PROCESS FOR TYPING OF HCV ISOLATES 



^ The invention relates to the use of probes targeting sequences from the 5' 

untranslated region of HCV for genotyping of HCV isolates. 

The invention also relates to a process for genotyping of HCV isolates. 
The invention also relates to a kit for genotyping of HCV isolates. 
Hepatitis C viruses (HCV) are a family of positive-stranded, enveloped 
RNA viruses causing the majority of non-A, non-B (NANB) hepatitis. Their 
genomic organization indicates a close relationship to the Pestiviridae and 
Flaviviridae. The sequences of cDNA clones covering the complete genome of 
several prototype isolates have aheady been completely determined (Kato et al, 
1990; Choo etal, 1991; Okamoto etal, 1991; Takamizawa era/., 1991; Okamoto 
et al, 1992b). These genomes are about 9500 base pairs long. The isolates 
reported by Kato, Takamizawa, and Choo contain an open reading frame (ORF) of 
3010 or 3011 amino acids, and those reported by Okamoto encode 3033 amino 
acids. Comparison of these isolates shows a considerable variability in the 
envelope (E) and non-structural (NS) regions, while the 5' untranslated region 
(UR) and, to a lesser extent, the core region are highly conserved. 

Using cloned sequences of the NS3 region, Kubo et al (1989) compared 
a Japanese and an American isolate and found nearly 80% nucleotide and 92% 
amino acid homology. The existence of sequence variability was further 
documented when sequences of the 5' UR, core, and El regions became available 
(HC-Jl and HC-J4; Okamoto et al, 1990). After the isolation of several NS5 
fragments in Japanese laboratories, two groupes, Kl and K2, were described 
(Enomoto et al, 1990). A comparison of the "American-like" isolate PT-1 with 
Kl, which was more prevalent in Japan, showed that they represent closely related 
but different subtypes with an intergroup nucleotide identity of about 80%>. The K2 
sequence was more distantly related to both Kl and FT- 1, because homologies of 
only 67% at the nucleic acid level, and 72% at the amino acid level were observed. 
Moreover, K2 could be divided into two groups, K2a and K2b, also showing 
intergroup nucleotide homologies of about 80%. Nucleotide sequence analysis in 
the 5' UR showed 99% identity between Kl and PT-1, and at most 94% identity 
between Kl and K2, enabling the use of the 5' UR for restriction fragment length 
polymorphism (RFLP) and classification of HCV into groups Kl and K2 (Nakao 
et al, 1991). Further evidence for a second group was given by the complete 
sequence of HC-J6 and HC-J8, two sequences related to the K2 group (Okamoto 
et a/., 1991; Okamoto et al, 1992b). A phylogenetic tree of HCV containing four 
branches (i.e., Type I: HCV-1 and HCV^H; Type E: HCV-J, -BK, HC-J4; Type ni: 
HC-J6; Type IV: HC-J8) was proposed by Okamoto^r al (1992b), However, 
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nucleic acid sequence homologies of 79% can be observed between Type I and 
Type II, and also between Type III and IV. A lesser degree of relatedness between 
the first group (Type I and II) and the second group (Types III and IV) of only 67- 
68% exists. Moreover, a new type of HCV, HCV-T, was detected in Thailand after 
studying NS5 regions (Mori et aL, 1992). HCV-T had a sequence homology of 
about 65% with all other known NS5 sequences, and two groups could be 
detected, HCV-Ta and HCV-Tb, which again exhibited nucleic acid sequence 
homologies of about 80%. Elucidation of the phylogenetic relationship of a similar 
new group found in British isolates with Type I to IV was possible by analyzing 
the conserved parts of the 5' UR, core, NS3, and NS5 regions (Chan et al, 1992a). 
A new phylogenetic tree was proposed, whereby *type V corresponds with Type I 
and n, 'type 2' with Type HI and IV, and *type 3' with their own isolates E-bl to E- 
b8 and HCV-T. Some sequences of the 5' UR of isolates from 'type 3' were also 
reported by others (Bukh et al, 1992; Cha et al, 1992; Lee et aL, 1992). 

Several patent applications have addressed the problem of detecting the 
presence of HCV by means of probes derived from the genome of type 1 HCV 
isolates (WO 92/02642, EP 419 182, EP 398 748, EP 469 438 and EP 461 863). 
Furthermore, the 5' UR of HCV isolates has been proven to be a good candidate 
for designing probes and primers for general HCV detection (Cha et ah, 1991; 
Inchaupse et al., 1991). However, none of these patent applications presents a 
method for identifying the type and/or subtype of HCV present in the sample to be 
analyzed. 

The demonstration that different HCV genotype infections resulted in 
different serological reactivities (Chan et al, 1991) and responses to interferon 
IFN-Y treatment (Pozatto et al, 1991; Kanai et al, 1992; Yoshioka et al, 1992) 
stresses the importance of HCV genotyping. Until now, this could only be 
achieved by large sequencing ejfforts in the coding region or in the 5* UR, or by 
polymerase chain reactions (PCR) on HCV cDNA with type-specific sets of core 
primers (Okamoto et al. 1992a), or by (RFLP) analysis in the 5* UR or in tiie NS5 
region (Nakao et al, 1991; Chan et al., 1992b). However, none of these above- 
mentioned patent applications or publications offers a reliable method for 
identifying the type or subtype of HCV present in the sample to be analyzed, 
especially since typing is laborious and subtyping seems to be even more laborious 
or impossible by means of these methods. In this respect, it can be noted that Lee 
et al. (1992) attempt to distinguish between the HCV isolates HCV 324 and HCV 
324X by means of PCR fragments from the 5' UR of the genomes of these isolates. 
The results demonstrate that these 5' UR probes do not show a specific reactivity 
with the genome of the respective isolate from which they were derived. 
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Consequently, the aim of the present invention is to provide a method for 
the rapid and indisputable determination of the presence of one or several 
genotypes of HCV present in a biological sample and indisputably classifying the 
determined isolate(s). 

Another aim of the invention is to provide a process for identifying yet 
unknown HCV types or subtypes. 

Another aim of the invention is to provide a process enabling the 
classification of infected biological fluids into different serological groups 
unambiguiously linked to types and subtypes at the genomic level. 

Another aim of the invention is to provide a kit for rapid detection of the 
presence or absence of different types or subtypes of HCV, 

The invention relates to the use of at least one probe, with said probe 
being (i) capable of hybridizing to a genotype specific target region, present in an 
analyte strand, in the domain extending from the nucleotides at positions -291 to 
-66 of the 5' untranslated region (UR) of one of the HCV isolates, or with said 
probe being (ii) complementary to any of the above-defined probes, for genotyping 
HCV isolates present in a biological sample. 

The invention relates to the use of at least one probe preferably containing 
from about 5 to about 50 nucleotides, more preferably from about 10 to about 40 
nucleotides, and most preferably containing from about 15 to about 30 nucleotides, 
with said probe being (i) capable of liable to hybridizing to a genotype specific 
target region present in an analyte strand in the domain extending from the 
nucleotides at positions -291 to -66 of the 5' UR of one of the HCV isolates 
represented by their cDNA sequences, for example represented by their cDNA 
sequences in Figure 2, with said negative numbering of the nucleotide positions 
starting at the nucleotide preceding the first ATG codon of the open reading frame 
encoding the HCV polyprotein, or with said probe being (ii) complementary to the 
above-defined probes, for ( in vitro^ genotyping HCV isolates present in a 
biological sample, with said sample being possibly previously identified as being 
HCV positive. 

The above mentioned process may be used for classifying said isolate 
according to the percentage of homology with other HCV isolates, according to 
the fact that isolates belonging to the same type: 

exhibit homology of more than 74% at the nucleic acid level in the 
complete genome; 

or exhibit homology of more than 74% at the nucleic acid level in the 
NS5 region between nucleotide positions 7935 and 8274; 

or of which the complete polyprotein shows more than 78% homology at 
the amino acid level; 
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or of which the NS5 region between amino acids at positions 2646 and 
2758 shows more than 80% homology at the amino acid level; 

and according to the fact that HCV isolates belonging to the same 
subtype exhibit homology of more than 90% at the nucleic acid level in the 
complete genome and of more than 90% at the amino acid level in the complete 
polyprotein. 

More preferably the above mentioned process relates to the classification 
of HCV isolates according to the fact that, 

(1) based on phylogenetic analysis of nucleic acid sequences in the NS5b 
region between nucleotides 7935 and 8274 (Choo et al., 1991) or 8261 and 8600 
(Kato et al., 1990) or 8342 and 8681 (Okamoto et al, 1991), isolates belonging to 
the same HCV type show nucleotide distances of less than 0.34, usually less than 
0.33, and more usually of less than 0.32, and isolates belonging to the same 
subtype show nucleotide distances of less than 0.135, usually of less than 0.13, 
and more usually of less than 0.125, and consequently isolates belonging to the 
same type but different subtypes show nucleotide distances ranging from 0.135 to 
0.34, usually ranging from 0.14 to 0.33, and more usually ranging from 0.15 to 
0.32, and isolates belonging to different HCV types show nucleotide distances 
greater than 0,34, usually greater than 0.35, and more usually of greater than 0.36, 

(2) based on phylogenetic analysis of nucleic acid sequences in the 
core/El region between nucleotides 378 and 957, isolates belonging to the same 
HCV type show nucleotide distances of less than 0.38, usually of less than 0.37, 
and more usually of less than 0.36, and isolates belonging to the same subtype 
show nucleotide distances of less than 0.17, usually of less than 0.16, and more 
usually of less than 0.15, and consequently isolates belonging to the same type but 
different subtypes show nucleotide distances ranging from 0.15 to 0.38, usually 
ranging from 0.16 to 0,37, and more usually ranging from 0,17 to 0,36, and 
isolates belonging to different HCV types show nucleotide distances greater than 
0.36, usually more than 0.365, and more usually of greater than 0.37, 

(3) based on phylogenetic analysis of nucleic acid sequences in the 
NS3/NS4 region between nucleotides 4664 and 5292 (Choo et al., 1991) or 
between nucleotides 4993 and 5621 (Kato et al,, 1990) or between nucleotides 
5017 and 5645 (Okamoto et al,, 1991), isolates belonging to the same HCV type 
show nucleotide distances of less than 0.35, usually of less than 0.34, and more 
usually of less than 0,33, and isolates belonging to the same subtype show 
nucleotide distances of less than 0.19, usually of less than 0,18, and more usually 
of less than 0.17, and consequently isolates belonging to the same type but 
different subtypes show nucleotide distances ranging from 0.17 to 0.35, usually 
ranging from 0.18 to 0,34, and more usually ranging from 0.19 to 0.33, and 



-5- 

isolates belonging to different HCV types show nucleotide distances greater than 
0.33, usually greater than 0.34, and more usually of greater than 0.35. 

The term "genotyping" refers to either typing and/or subtyping. A 
method for 'genotyping' HCV isolates is considered to, at least partly, classify 
HCV isolates into genotypes. A HCV 'genotype' is a group of HCV isolates with 
related sequences. Said related sequences are defined as showing nucleotide 
distances as indicated above and as illustrated in example 9. Both larger groups 
(HCV types) and smaller groups (HCV subtypes) have been shown to be related, 
A HCV type always includes one or more HCV subtypes. Consequently, a method 
for genotyping can aim at typing (classification into HCV types) of HCV isolates 
without the need for subtyping (classification into HCV subtypes), or, in a 
preferred embodiment, subtyping can be aimed at. It should be understood that 
classification into subtypes inherently yields data for classification into types. 

The expression "genotype specific target region" refers at least one 
nucleotide variation observed between different HCV genotypes in the 5' 
untranslated region (UR) as can be readily deduced from figures 2 and 4. 

The term "HCV polyprotein" refers to the HCV polyprotein of the HCV-J 
isolate (Kato et al, 1990), which belongs to subtype lb. 

The expression "probe" corresponds to any polynucleotide which forms a 
hybrid with a target sequence present in a certain HCV isolate on the basis of 
complementarity. Such a probe may be composed of DNA, RNA, or synthetic 
nucleotide analogs. The probes of the invention can be incubated with an analyte 
strand immobilized to a solid substrate. In a preferred embodiment of the 
invention, the probes themselves can be immobilized to a solid substrate. These 
probes may fiirther include capture probes, characterized as being coupled to a 
binding molecule which in turn is directly or indirectiy bound to a solid substrate, 
or may also include label probes, characterized in that they carry a detectable label. 

The invention relates to a process for genotyping HCV isolates present in 
a biological sample; comprising the steps of: 

- contacting said sample in which the ribonucleotides or 
deoxyribonucleotides have been made accessible, if need be, under suitable 
denaturation, with at least one probe, with said probe being (i) capable of 
hybridizing to a region in the domain extending from nucleotides at positions -291 
to -66 of the 5' untranslated region of one of the HCV isolates, or with said probe 
being (ii) complementary to any of the above-defined probes, and, 

- detecting the complexes possibly formed between said 
probe and the nucleotide sequence of the HCV isolate to be identified. 

The invention relates also to a process for genotyping an HCV 
isolate present in a biological sample, comprising the steps of: 
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- contacting said sample in which the ribonucleotides and deoxyribonucleotides 
have been made accessible, if need be, under suitable denaturation, with at least 
one probe from preferably from about 5 to 50, more preferably from about 10 to 
about 40 nucleotides most preferably from about 15 to about 30 nucleotides, with 
said probe being (i) capable of hybridizing to a region in the domain extending 
from nucleotides at positions -291 to -66 of the 5' UR of one of the HCV isolates 
represented by their cDNA sequences, for example represented by their cDNA 
sequences in Figure 2, with said negative numbering of position starting at the 
nucleotide preceding the first ATG codon of the open reading frame encoding the 
HCV polyprotein, or with said probe being complementary to the above-defined 
probes, 

- detecting the complexes possibly formed between said probe and the nucleotide 
sequence of the HCV isolate to be identified, and, inferring the type(s) of HCV 
isolates present from the hybridization pattern. 

The above mentioned method can be considered as a method for 
classifying said isolate according to the percentage of homology with other HCV 
isolates, according to the fact that isolates belonging to the same type: 
exhibit homology of more than 74% at the nucleic acid level in the complete 
genome, 

or exhibit homology of more than 74% at the nucleic acid level in the NS5 region 
between nucleotide positions 7935 and 8274, 

or of which the complete polyprotein shows more than 78% homology at the 
amino acid level, 

or of which the NS5 region between amino acids at positions 2646 and 2758 
shows more than 80% homology at the amino acid level, 

and according to the fact that HCV isolates belonging to the same subtype exhibit 
homology of more than 90% at the nucleic acid level in the complete genome and 
of more than 90% at the amino acid level in the complete polyprotein. 

More preferably, said method relates to the classification of HCV isolates 
according to the fact that, 

(1) based on phylogenetic analysis of nucleic acid sequences in the NS5b 
region between nucleotides 7935 and 8274 (Choo et al., 1991) or 8261 and 8600 
(Kato et al., 1990) or 8342 and 8681 (Okamoto et al, 1991), isolates belonging to 
the same HCV type show nucleotide distances of less than 0.34, usually less than 
033, and more usually of less than 0.32, and isolates belonging to the same 
subtype show nucleotide distances of less than 0.135, usually of less than 0.13, 
and more usually of less than 0.125, and consequently isolates belonging to the 
same type but different subtypes show nucleotide distances ranging from 0.135 to 
0.34, usually ranging from 0.14 to 0.33, and more usually ranging from 0.15 to 
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0.32, and isolates belonging to different HCV types show nucleotide distances 
greater than 0.34, usually greater than 0.35, and more usually of greater than 0.36, 

(2) based on phylogenetic analysis of nucleic acid sequences in the 
core/El region between nucleotides 378 and 957, isolates belonging to the same 
HCV type show nucleotide distances of less than 0.38, usually of less than 0.37, 
and more usually of less than 0,36, and isolates belonging to the same subtype 
show nucleotide distances of less than 0.17, usually of less than 0.16, and more 
usually of less than 0.15, and consequently isolates belonging to the same type but 
different subtypes show nucleotide distances ranging from 0.15 to 0.38, usually 
ranging from 0,16 to 0.37, and more usually ranging from 0.17 to 0.36, and 
isolates belonging to different HCV types show nucleotide distances greater than 
0.36, usually more than 0.365, and more usually of greater than 0.37, 

(3) based on phylogenetic analysis of nucleic acid sequences in the 
NS3/NS4 region between nucleotides 4664 and 5292 (Choo et al, 1991) or 
between nucleotides 4993 and 5621 (Kato et al., 1990) or between nucleotides 
5017 and 5645 (Okamoto et al., 1991), isolates belonging to the same HCV type 
show nucleotide distances of less than 0.35, usually of less than 0.34, and more 
usually of less than 0.33, and isolates belonging to the same subtype show 
nucleotide distances of less than 0,19, usually of less than 0.18, and more usually 
of less than 0.17, and consequently isolates belonging to the same type but 
different subtypes show nucleotide distances ranging from 0.17 to 0.35, usually 
ranging from 0.18 to 0.34, and more usually ranging from 0.19 to 0.33, and 
isolates belonging to different HCV types show nucleotide distances greater than 
0.33, usually greater than 0.34, and more usually of greater than 0.35. 

The term "analyte strand" corresponds to a single- or double-stranded 
nucleic acid molecule which is suspected to contain sequences which may be 
present in a biological sample, with said analyte strand being directly detected or 
detected after amplification. This analyte strand is preferentially positive- or 
negative-stranded RNA, cDNA, or amplified cDNA. 

The expression "biological sample" may refer to any biological sample 
(tissue or fluid) containing HCV sequences and refers more particularly to blood 
serum or plasma samples. 

The detection of hybrids formed between the type- or subtype-specific 
target region, if present, and the probes as mentioned above depends on the nature 
of the reporter molecule used (either present on the probe or on the analyte strand 
to be targeted) and may be determined by means of colorimetric, fluorescent, 
radiometric detection or any other method comprised in the state of the art. 

The term "(HCV) isolates" refers to any biological fluid containing 
hepatitis C virus genetic material obtained from naturally infected humans or 




8 



experimentally infected animals, and also refers to fluids containing hepatitis C 
virus genetic material which has been obtained from in vitro experiments. For 
instance, from in vitro cultivation experiments, both cells and growth medium can 
be employed as a source of HCV genomes material. 

The expression "hybridize" or "target" refers to a hybridization 
experiment carried out according to any method known in the art, and allowing the 
detection of homologous targets (including one or few mismatches) or preferably 
completely homologous targets (no mismatches allowed). 

In the present invention, a sensitive PCR protocol has been used for the 
highly conserved 5' UR with sets of nested, universal primers. Positions and 
sequences of these primers were derived from the sequences of previously 
reported type 1 and 2 sequences, and the type 3 sequence BR56 (Fig. 2). The 
obtained amplification product was hybridized to oligonucleotides directed against 
the variable regions of the 5' UR, immobilized as parallel lines on membrane strips 
(reverse-hybridization principle). This hybridization assay, called line probe assay 
(LiPA), is a rapid assay, by means of which previously poorly described isolates 
similar to Z4, Z6, and Z7 (Bukh et al, 1992) were detected. A new type 4 
classification is proposed for these strains of HCV. Other isolates similar to BE95 
and BE96, and to SAl (Cha et al., 1992) can be distinguished and it is proposed to 
classify such isolates as type 5a. Isolates similar to HK2 (Bukh et al., 1992) can be 
distinguished and a new type 6a classification is proposed. A new genotype was 
detected in isolate BE98, and it is proposed to classify this isolate into HCV type 
3, subtype 3c. Another new sequence was detected in GB438, which could be 
classified as 4f. This LiPA technology allows an easy and fast determination of 
HCV types and their subtypes present in patient serum. 

According to a preferred embodiment of the invention, a set of probes 
comprising at least two probes is used. 

According to a preferred embodiment, in the process of the invention the 
probe used targets a region of at least 5 nucleotides in one of the following 
domains: 



a) the one extending from nucleotide at position -293 to 



nucleotide at position -278 in Figure 2, 



b) the one extending from nucleotide at position -275 to 



nucleotide at position -260 in Figure 2, 



c) the one extending from nucleotide at position -253 to 



nucleotide at position -238 in Figure 2, 



d) the one extending from nucleotide at position -244 to 



nucleotide at position -229 in Figure 2, 
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e) the on extending from nucleotide at position -238 to 
nucleotide at position -223 in Figure 2, 

f) the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2, 

g) the one extending from nucleotide at position -141 to 
nucleotide at position -1 17 in Figure 2, 

h) the one extending from nucleotide at position -83 to 
nucleotide at position -68 in Figure 2, 

i) the one extending from nucleotide at position -103 to 
nucleotide at position -88 in Figure 2, 

j) the one extending from nucleotide at position -146 to 
nucleotide at position -130. 

Regions -170 to -155 and -141 to -117 represent variable regions in the 
linear sequence which may be part of the same stem in the viral RNA. 
Consequently mutations in one region may be complemented by another mutation 
in another region to allow or disallow RNA duplex formation. Variation is 
expected to occur at the same positions in other new types of HCV as well and, 
therefore, these variable regions might remain instrumental for the discrimination 
between all current and yet-to-be discovered types of HCV. 

According to yet another embodiment the present invention relates to a 
probe comprising a sequence such that it targets at least one of the following 
sequences: 

AAT TGC CAG GAC GAC C (SEQ ID NO 5) 
TCT CCA GGC ATT GAG C (SEQ ID NO 6) 
CCG CGA GAC TGC TAG C (SEQ ID NO 7) 
TAG CGT TGG GTT GCG A (SEQ ID NO 8) 
TTR CCG GRA AGA CTG G (SEQ ID NO 9) 
TGR CCG GGC ATA GAG T (SEQ ID NO 10) 
TTA CCG GGA AGA CTG G (SEQ ID NO 1 1) 
TGA CCG GAC ATA GAG T (SEQ ID NO 12) 
AAT CGC TGG GGT GAC C (SEQ ID NO 1 3) 
TTT CTG GGT ATT GAG C (SEQ ID NO 14) 
TCT TGG AGC AAC CCG C (SEQ ID NO 15) 
TCT TGG AAC AAC CCG C (SEQ ID NO 16) 
AAT YGC CGG GAT GAC C (SEQ ID NO 17) 
TTC TTG GAA CTA ACC C (SEQ ID NO 18) 
TTT CCG GGC ATT GAG C (SEQ ID NO 19) 
TTG GGC GYG CCC CCG C (SEQ ID NO 20) 
CCG CGA GAT CAC TAG C (SEQ ID NO 21) 
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CCG GGA AGA CTG GGT C (SEQ ID NO 22) 
CCG GAA AGA CTG GGT C (SEQ ID NO 23) 
ACC CAC TCT ATG CCC G (SEQ ID NO 24) 
ACC CAC TCT ATG TCC G (SEQ ID NO 25) 
ATA GAG TGG GTT TAT C (SEQ ID NO 26) 
TCT GCG GAA CCG GTG A (SEQ ID NO 27) 
AAT TGC CAG GAY GAC C (SEQ ID NO 28) 
GCT CAG TGC CTG GAG A (SEQ ID NO 29) 
CCG CGA GAC YGC TAG C (SEQ ID NO 30) 
CCC CGC AAG ACT GCT A (SEQ ID NO 3 1) 
CGT ACA GCC TCC AGG C (SEQ ID NO 32) 
GGA CCC AGT CTT CCT G (SEQ ID NO 33) 
TGC CTG GTC ATT TGG G (SEQ ID NO 34) 
TKT CTG GGT ATT GAG C (SEQ ID NO 35) 
CCG CAA GAT CAC TAG C (SEQ ID NO 36) 
GAG TGT TGT ACA GCC T (SEQ ID NO 37) 
AAT CGC CGG GAT GAC C (SEQ ID NO 38) 
GAG TGT TGT GCA GCC T (SEQ ID NO 39) 
AAT CGC CGG GAC GAC C (SEQ ID NO 40) 
AAT GCC CGG CAA TTT G (SEQ ID NO 41) 
AAT CGC CGA GAT GAC C (SEQ ID NO 42) 
AAT GCT CGG AAA TTT G (SEQ ID NO 43) 
GAG TGT CGA ACA GCC T (SEQ ID NO 44) 
AAT TGC CGG GAT GAC C (SEQ ID NO 45) 
TCT CCG GGC ATT GAG C (SEQ ID NO 46) 
AAT TGC CGG GAC GAC C (SEQ ID NO 47) 
GGG TCC TTT CCA TTG G (SEQ ID NO 48) 
AAT CGC CAG GAT GAC C (SEQ ID NO 49) 
TGC CTG GAA ATT TGG G (SEQ ID NO 50) 
GAG TGT CGT ACA GCC T (SEQ ID NO 51) 
AGT YCA CCG GAA TCG C (SEQ ID NO 52) 
GGA ATG GCC AGG ACG A (SEQ ID NO 53) 
GAA TCG CCG GGT TGA C (SEQ ID NO 54) 
GAG TGT TGT ACA GCC TCC (SEQ ID NO 93) 
TGC CCG GAA ATT TGG GC (SEQ ID NO 94) 
TGC CCG GAG ATT TGG G (SEQ ID NO 95) 
GAG TGT CGA ACA GCC TC (SEQ ID NO 96) 



wherein Y represents T or C 
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K represents G or T 
and R represents G or A 

- or the corresponding sequence wherein T has been replaced by U, 

- or the sequences which are complementary to the above-defined sequences. 

According to another advantagous embodiment of the invention, at least 
two of the above-mentioned probes or a mixture of two of these probes is used to 
discriminate between various HCV types or subtypes as defined below. 

According to a preferred embodiment of the process of the invention, for 
each type or subtype of HCV to be determined, a set of two different probes or a 
mixture of two different probes is used, with each probe of the set or of the 
mixture respectively targeting a different region chosen among the regions as 
defined above, and more particularly wherein the two probes, in said set or in said 
mixture, consist of 10 to 40 contiguous nucleotides respectively targeting two 
regions respectively chosen from among the following pairs of domains: 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending from nucleotide at 
position -141 to nucleotide at position -1 17 in Figure 2, 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending from nucleotide at 
position -103 to nucleotide at position -88 in Figure 2, 

* the one extending from nucleotide at position -141 to 
nucleotide at position -117 in Figure 2 and the one extending from nucleotide at 
position -103 to nucleotide at position -88 in Figure 2, 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending firom nucleotide at 
position -83 to nucleotide at position -68 in Figure 2, 

* the one extending fi-om nucleotide at position -141 to 
nucleotide at position -117 in Figure 2 and the one extending fi-om nucleotide at 
position -83 to nucleotide at position -68 in Figure 2, 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending from nucleotide at 
position -146 to nucleotide at position -130 in Figure 2, 

* the one extending from nucleotide at position -132 to 
nucleotide at position -117 in Figxire 2 and the one extending from nucleotide at 
position -146 to nucleotide at position -130 in Figure 2, 

* the one extending from nucleotide at position - 146 to 
nucleotide at position - 130 in Figure 2 and the one extending from nucleotide at 
position - 103 to nucleotide at position - 88 in Figure 2. 
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The invention also relates to a probe having a sequence such that it 

targets: 

- the following sequence: TTC TTG GAA CTA ACC C, 

- or the corresponding sequence wherein T has been replaced by U, 

- or the sequences which are complementary to the above-defined 
sequences. 

The invention also relates to a set of two probes or mixtures of two 
probes wherein each of the two probes consists of 1 0 to 40 contiguous nucleotides, 
and wherein the two probes respectively target two regions respectively chosen 
from among the following pairs of domains: 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending from nucleotide at 
position - 141 to nucleotide at position -1 17 in Figure 2, 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending from nucleotide at 
position -103 to nucleotide at position -88 in Figure 2, 

* the one extending from nucleotide at position -141 to 
nucleotide at position -117 in Figure 2 and the one extending from nucleotide at 
position -103 to nucleotide at position -88 in Figure 2, 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending from nucleotide at 
position -83 to nucleotide at position -68 in Figure 2, 

* the one extending from nucleotide at position -141 to 
nucleotide at position -117 in Figure 2 and the one extending from nucleotide at 
position -83 to nucleotide at position -68 in Figure 2, 

* the one extending from nucleotide at position -170 to 
nucleotide at position -155 in Figure 2 and the one extending from nucleotide at 
position -146 to nucleotide at position -130 in Figure 2, 

* the one extending from nucleotide at position -132 to 
nucleotide at position -117 in Figure 2 and the one extending from nucleotide at 
position -146 to nucleotide at position -130 in Figure 2, 

* the one extending from nucleotide at position -146 to 
nucleotide at position -130 in Figure 2 and the one extending from nucleotide at 
position -103 to nucleotide at position -88 in Figure 2. 

According to a preferred embodiment, the invention relates to a process 
for typing HCV isolates as belonging to at least one of the following HCV types: 
HCV type 1, HCV type 2, HCV type 3, HCV type 4, HCV type 5, HCV type 6 
from a biological sample liable to contain it, and comprises the steps of: 
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- contacting said sample in which the ribonucleotides or 
deoxyribonucleotides have been made accessible, if need be under suitable 
denaturation, with at least one probe being capable of hybridizing to a region in 
the domain extending from nucleotide at position -291 to nucleotide at position - 
66 of the 5' UR of HCV isolates represented by their cDNA sequences in Figure 2 
and 4, with said negative numbering of the nucleotide position starting at the 
nucleotide preceding the first ATG codon in the open reading frame encoding the 
HCV polyprotein or with said probe being complementary to the above-defined 
probes; 

- detecting the complexes possibly formed between said probe and 
the target region,and, 

- inferring the HCV types present from the observed 
hybridization pattern. 

According to a preferred embodiment, the invention relates to a process 
for typing HCV isolates as belonging to at least one of the following HCV types: 
HCV type 1, HCV type 2, HCV type 3, HCV type 4, HCV type 5, and HCV type 
6, and is such that the probes used are able to target one of the following target 
regions or said regions wherein T has been replaced by U, or the regions which are 
complementary to the above-said regions: 

for HCV type 1 and 6: AAT TGC CAG GAC GAC C (No. 5) 

TCT CCA GGC ATT GAG C (No. 6) 
AAT TGC CAG GAY GAC C (No. 28) 

for HCV type 1 : GCT CAG TGC CTG GAG A (No. 29) 

for HCV type 2: TAG CGT TGG GTT GCG A (No. 8) 

TTR CCG GRA AGA CTG G (No. 9) 
TGR CCG GGC ATA GAG T (No. 10) 
TTA CCG GGA AGA CTG G (No. 1 1) 
TGA CCG GAC ATA GAG T (No, 12) 
CGT ACA GCC TCC AGG C (No. 32) 
CCG GGA AGA CTG GGT C (No, 22) 
CCG GAA AGA CTG GGT C (No, 23) 
ACC CAC TCT ATG CCC G (No. 24) 
ACC CAC TCT ATG TCC G (No, 25) 
ATA GAG TGG GTT TAT C (No. 26) 
GGA CCC AGT CTT CCT G (No. 33) 
TGC CTG GTC ATT TGG G (No. 34) 
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for HCV type 3 : AAT CGC TGG GOT GAC C (No. 1 3) 

TTT CTG GGT ATT GAG C (No. 14) 
CCG CGA GAT CAC TAG C (No. 21) 
CCG CAA GAT CAC TAG C (No. 36) 
GAA TCG CCG GGT TGA C (No. 54) 

for HCV type 4 and 5 : AAT YGC CGG GAT GAC C (No. 1 7) 

for HCV type 4: TTC TTG GAA CTA ACC C (No. 1 8) 

for HCV type 4, 3c and 3b: TTT CCG GGC ATT GAG C (No. 1 9) 

for HCV type 4 and 3b: AAT CGC CGG GAT GAC C (No. 38) 

for HCV type 4: GAG TGT TGT ACA GCC T (No. 37) 

GAG TGT TGT GCA GCC T (No. 39) 
AAT CGC CGG GAC GAC C (No. 40) 
AAT GCC CGG CAA TTT G (No. 41) 
AAT CGC CGA GAT GAC C (No. 42) 
AAT GCT CGG AAA TTT G (No. 43) 
AAT CGC CAG GAT GAC C (No. 49) 
TGC CTG GAA ATT TGG G (No. 50) 
GGA ATC GCC AGG ACG A (No. 53) 

for HCV type 5: AAT TGC CGG GAT GAC C (No. 45) 

AAT TGC CGG GAC GAC C (No. 47) 
TCT CCG GGC ATT GAG C (No. 46) 
GAG TGT CGA ACA GCC T (No. 44) 

for HCV type 6: GGG TCC TTT CCA TTG G (No. 48) 

wherein Y represents C or T, and K represents G or T, or the probes used are a set 
of two probes chosen from among the above-defined probes. 

The invention also relates to the use of the above-defined method 
for determining the type(s) of HCV isolates present in a biological sample. 

The term "type" corresponds to a group of HCV isolates of which 
the complete genome shows more than 74% homology at the nucleic acid level, or 
of which the NS5 region between nucleotide positions 7935 and 8274 shows more 
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than 74% homology at the nucleic acid level, or of which the complete HCV 
polyprotein shows more than 78% homology at the amino acid level, or of which 
the NS5 region between amino acids at positions 2646 and 2758 shows more than 
80% homology at the amino acid level, to genomes of the other isolates of the 
group, with said numbering beginning with the first ATG codon or methionine of 
the HCV polyprotein of the HCV-J isolate (Kato et al., 1990). Isolates belonging 
to different types of HCV exhibit homologies of less than 74% at the nucleic acid 
level and less than 78% at the amino acid level. Isolates belonging to the same 
type usually show homologies of about 92 to 95% at the nucleic acid level and 95 
to 96% at the amino acid level when belonging to the same subtype, and those 
belonging to the same type but different subtypes preferably show homologies of 
about 79% at the nucleic acid level and 85-86% at the amino acid level. More 
preferably, classification of HCV isolates should be performed according to the 
fact that, 

(1) based on phylogenetic analysis of nucleic acid sequences in 
the NS5b region between nucleotides 7935 and 8274 (Choo et al, 1991) or 8261 
and 8600 (Kato et ah, 1990) or 8342 and 8681 (Okamoto et al, 1991), isolates 
belonging to the same HCV type show nucleotide distances of less than 0,34, 
usually less than 0.33, and more usually of less than 0.32, and isolates belonging 
to the same subtype show nucleotide distances of less than 0.135, usually of less 
than 0.13, and more usually of less than 0.125, and consequently isolates 
belonging to the same type but different subtypes show nucleotide distances 
ranging from 0,135 to 0.34, usually ranging from 0.14 to 0.33, and more usually 
ranging from 0.15 to 0.32, and isolates belonging to different HCV types show 
nucleotide distances greater than 0.34, usually greater than 0.35, and more usually 
of greater than 0.36, 

(2) based on phylogenetic analysis of nucleic acid sequences in 
the core/El region between nucleotides 378 and 957, isolates belonging to the 
same HCV type show nucleotide distances of less than 0.38, usually of less than 
0.37, and more usually of less than 0.36, and isolates belonging to the same 
subtype show nucleotide distances of less than 0.17, usually of less than 0.16, and 
more usually of less than 0.15, and consequently isolates belonging to the same 
type but different subtypes show nucleotide distances ranging from 0.15 to 0.38, 
usually ranging from 0.16 to 0.37, and more usually ranging from 0.17 to 0.36, and 
isolates belonging to different HCV types show nucleotide distances greater than 
0.36, usually more than 0.365, and more usually of greater than 0.37, 
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(3) based on phylogenetic analysis of nucleic acid sequences in 
the NS3/NS4 region between nucleotides 4664 and 5292 (Choo et al., 1991) or 
between nucleotides 4993 and 5621 (Kato et aL, 1990) or between nucleotides 
5017 and 5645 (Okamoto et aL, 1991), isolates belonging to the same HCV type 
show nucleotide distances of less than 0.35, usually of less than 0.34, and more 
usually of less than 0.33, and isolates belonging to the same subtype show 
nucleotide distances of less than 0.19, usually of less than 0.18, and more usually 
of less than 0.17, and consequently isolates belonging to the same type but 
different subtypes show nucleotide distances ranging from 0.17 to 0.35, usually 
ranging from 0.18 to 0.34, and more usually ranging from 0.19 to 0.33, and 
isolates belonging to different HCV types show nucleotide distances greater than 
0.33, usually greater than 0.34, and more usually of greater than 0.35. 

According to a preferred embodiment of this invention any of the 
probes designated with SEQ ID NO 5, 28 and 6 may be used to identify the type 1 ; 
any of the probes with SEQ ID NO 8 to 12 or 22 to 26 and 32 to 34 may be used to 
identify type 2; and any of the probes with SEQ ID NO 13, 14, 36, 21, or 54 to 
identify type 3; and any of the probes with SEQ ID NO 17, 18 or 19 and 37 to 43 
and probes of SEQ ID NO 49, 50 and 53 to identify type 4. 

Probes 44 to 47 may be used to identify type 5, probe 48 may be 
used to identify type 6. 

The following regions might also be used for discrimination of 
certain types: the region between positions -238 to -223 for type 2, the region 
between positions -244 to -229 for type 4, the regions between positions -253 to - 
238, or between positions -275 to -260, or between positions -293 to -278 for type 
3. 

The nucleotide at position -2 can also be employed to further 
discriminate between certain types or subtypes. 

The process of the invention also comprises the discrimination 
and classification of subtypes of HCV, wherein besides the above-mentioned 
probes also probes hybridizing to the following target regions are used, or said 
regions wherein T is replaced by U or said regions which are complementary to 
the above-defined regions, 

for HCV type 1, subtype la: 

CCC CGC AAG ACT OCT A (No. 3 1) 



for HCV type 1, subtype lb: 

CCG CGA GAC TGC TAG C (No. 7) 
CCG CGA GAC YGC TAG C (No. 30) 
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wherein Y represents C or T, 



for HCV type 2, subtype 2a: 

TTR CCG GRA AGA CTG G (No. 9) 
TGR CCG GGC ATA GAG T (No. 10) 
CCG GGA AGA CTG GGT C (No. 22) 
ACC CAC TOT ATG CCC G (No. 24) 

wherein R represents A or G, 

for HCV type 2, subtype 2b: 

TTA CCG GGA AGA CTG G (No. 1 1) 
TGA CCG GAC ATA GAG T (No. 1 2) 
CCG GAA AGA CTG GGT C (No. 23) 
ACC CAC TCT ATG TCC G (No. 25) 

for HCV type 2, subtype 2c: 

GGA CCC AGT CTT CCT G (No. 33) 
TGC CTG GTC ATT TGG G (No. 34) 

for HCV type 3, subtype 3a: 

AAT CGC TGG GGT GAC C (No. 13) 
TTT CTG GGT ATT GAG C (No. 14) 
TKT CTG GGT ATT GAG C (No. 35) 

wherein K represents G or T, 

for HCV type 3, subtype 3b: 

TTT CCG GGC ATT GAG C (No. 19) 
AAT CGC CGG GAT GAC C (No. 38) 
CCG CGA GAT CAC TAG C (No. 21) 

for HCV type 3, subtype 3c: 

GAG TGT CGT AC A GCC T (No. 51) 
GAA TCG CCG GGT TGA C (No. 54) 
TTT CCG GGC ATT GAG C (No. 19) 
CCG CGA GAC TGC TAG C (No. 7) 



for HCV type 4, subtype 4a or 4d: 

AAT CGC CGG GAT GAC C (No. 38) 
TTT CCG GGC ATT GAG C (No. 19) 
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for type 4, subtype 4b : 

AAT CGC CGG GAT GAG C (No. 38) 
AAT GCC CGG CAA TTT G (No. 41) 
AAT CGC CGG GAC GAC C (No. 40) 

for type 4, subtype 4c : 

AAT CGC CGA GAT GAC C (No. 42) 
AAT GCT CGG AAA TTT G (No. 43) 
TGC CTG GAA ATT TGG G (No. 50) 
GGA ATC GCC AGG ACG A (No. 53) 
CCG CGA GAC TGC TAG C (No. 7) 

for type 4, subtype 4e : 

AAT CGC CGG GAC GAC C (No. 40) 
GAG TGT TGT GCA GCC T (No. 39) 
AAT GCC CGG CAA TTT G (No. 41) 

for type 4, subtype 4f : 

TTT CCG GGC ATT GAG C (No. 19) 
AAT CGC CGG GAT GAC C (No. 38) 
GAG TGT CGT ACA GCC T (No. 5 1) 
CCG CGA GAC TGC TAG C (No. 7) 

for type 4, subtype 4g (provisional): 

TGC CTG GAA ATT TGG G (No. 50) 
GGA ATC GCC AGG ACG A (No. 53) 

for type 4, subtype 4h (provisional): 

AAT CGC CAG GAT GAC C (No. 49) 
TGC CTG GAA ATT TGG G (No. 50) 

or the probes used are a set of two probes chosen from among the defined 

probes. 

The invention also relates to the use of the above-defined method for 
determining the HCV subtype(s) present in a biological sample to be analyzed. 

The term "subtype" corresponds to a group of HCV isolates of which the 
complete genome or complete polyprotein shows a homology of more than 90% 
both at the nucleic acid and amino acid levels, or of which the region in NS5 



m 
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between nucleotide positions 7935 and 8274 shows a homology of more than 88% 
at the nucleic acid level to the corresponding parts of the genomes of the other 
isolates of the same group, with said numbering begirming with the adenine 
residue of the initiation coding of the long ORF, Isolates belonging to different 
subtypes of HCV and belonging to the same type of HCV show homologies of 
more than 74% at the nucleic acid level and of more than 78% at the amino acid 
level 

More preferably the above mentioned process relates to classification of 
HCV isolates into type and subtypes should be performed according to the fact that, 

(1) based on phylogenetic analysis of nucleic acid sequences in the NS5b 
region between nucleotides 7935 and 8274 (Choo et aL, 1991) or 8261 and 8600 
(Kato et aL, 1990) or 8342 and 8681 (Okamoto et al, 1991), isolates belonging to 
the same HCV type show nucleotide distances of less than 0,34, usually less than 
0.33, and more usually of less than 0,32, and isolates belonging to the same 
subtype show nucleotide distances of less than 0,135, usually of less than 0.13, 
and more usually of less than 0.125, and consequently isolates belonging to the 
same type but different subtypes show nucleotide distances ranging from 0.135 to 
0.34, usually ranging from 0.14 to 0.33, and more usually ranging from 0.15 to 
0.32, and isolates belonging to different HCV types show nucleotide distances 
greater than 0.34, usually greater than 0.35, and more usually of greater than 0,36, 

(2) based on phylogenetic analysis of nucleic acid sequences in the 
core/El region between nucleotides 378 and 957, isolates belonging to the same 
HCV type show nucleotide distances of less than 0.38, usually of less than 0,37, 
and more usually of less than 0.36, and isolates belonging to the same subtype 
show nucleotide distances of less than 0.17, usually of less than 0,16, and more 
usually of less than 0.15, and consequently isolates belonging to the same type but 
different subtypes show nucleotide distances ranging firom 0.15 to 0.38, usually 
ranging from 0.16 to 0.37, and more usually ranging from 0.17 to 0.36, and 
isolates belonging to different HCV types show nucleotide distances greater than 
0.36, usually more than 0.365, and more usually of greater than 0.37, 

(3) based on phylogenetic analysis of nucleic acid sequences in the 
NS3/NS4 region between nucleotides 4664 and 5292 (Choo et al., 1991) or 
between nucleotides 4993 and 5621 (Kato et al., 1990) or between nucleotides 
5017 and 5645 (Okamoto et aL, 1991), isolates belonging to the same HCV type 
show nucleotide distances of less than 0.35, usually of less than 0.34, and more 
usually of less than 0,33, and isolates belonging to the same subtype show 
nucleotide distances of less than 0.19, usually of less than 0.18, and more usually 
of less than 0.17, and consequently isolates belonging to the same type but 
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different subtypes show nucleotide distances ranging from 0.17 to 0.35, usually 
ranging from 0.18 to 0.34, and more usually ranging from 0.19 to 0,33, and 
isolates belonging to different HCV types show nucleotide distances greater than 
0,33, usually greater than 0.34, and more usually of greater than 0.35, 

Using these criteria, HCV isolates can be classified into at least 6 types. 

Several subtypes can clearly be distinguished in types 1, 2, 3 and 4 (la, 
lb, 2a, 2b, 2c, 3a, 3b, 3c, 4a, 4b, 4c, 4d, 4e and 4f) based on homologies of the 5' 
UR and coding regions including the part of NS5 between positions 7935 and 
8274 and the C/El region between nucleotides 317 and 957, and based on 
comparisons with isolates Zl and DK13 as described in Bukh et al (1 993). 

Further subdivision of type 4 into subtypes 4g and 4h is tentative and 
only based on differences in the 5' UR, An overview of most of the reported 
isolates and their proposed classification according to the typing system of the 
present invention is given in Table L 

According to a preferred embodiment of the present invention, the probe 
with SEQ ID NO 31 may be used to identify subtype la; the probes with SEQ ID 
NO 7 and 30 may be used to identify subtype lb; any of the probes with SEQ ID 
NO 9, 10, 22, or 24 may be used to identify subtype 2a; any of the probes with 
SEQ ID NO 11, 12, 23, or 25 may be used to identify subtype 2b; any of the 
probes with SEQ ID NO 33 or 34 may be used to identify subtype 2c; any of the 
probes with SEQ ID NO 13, 14, or 35 may be used to identify subtype 3a; any of 
the probes with SEQ ID NO 38, 19 and 21 may be used to identify subtype 3b, 4a 
or.4d; any of the probes with SEQ ID NO 38 or 41 may be used to identify subtype 
4b; any of the probes with SEQ ID NO 42 or 43 may be used to identify subtype 
4c; any of the probes in SEQ ID NO 39, 40, or 41 may be used to identify: 4e, 51 , 
38, 19, or 7; 4f; any of the probes with SEQ ID NO 49 or 50 may be used to 
identify the putative subtype 4h; any of the probes with SEQ ID NO 50 or 53 may 
be used to identify the putative subtype 4g. 

According to a preferred embodiment of the process of the invention, the 
HCV types or subt5qDes to be discriminated are also identified by means of 
universal probes for HCV, such as the ones targeting one of the following regions: 

TTG GGC GYG CCC CCG C (No. 20) 
TCT GCG GAA CCG GTG A (No. 27) 

According to another advantageous embodiment of the process of the 
invention, the hybridization step is preceeded by an amplification step of the 
deoxyribonucleotide or ribonucleotide containing the region to target, 
advantageously comprising the following steps : 

- contacting the biological sample liable to contain the 
isolate to be typed or subtj^ed with a set of primers, flanking the region to target, 
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with said primers being complementary to conserved regions of the HCV genome^ 
and preferably primers being complementary to the 5' untranslated conserved 
regions of the HCV genome, with said primers preferably having at least 8 
contiguous nucleotides more preferably about 15, and even more preferably more 
than 15 contiguous nuclotides, with said contiguous nucleotides being respectively 
complementary to sequences chosen from the region extending from nucleotide - 
341 to nucleotide -171 and from the region extending from nucleotide -67 to 
nucleotide -1, of Figures 2 and 4. 

Alternatively, the antisense primers could also extend into the core region 
or the set of primers may or/be aimed at amplifying both the 5'UR and the core 
region, either in 1 PGR fragment or with a set of primers for each of the two 
regions. Consequently, probes from the core region, able to hybrize to (sub)type 
specific regions in core PGR products, may be included in the line probe assay to 
further discriminate between types and/or subtypes, 

- amplifying the target region, for instance via a 
polymerase chain reaction by means of the above-mentioned set of primers and 
possibly incorporating a label such as digoxigenin or biotin into the amplified 
target sequence, with said amplifying being repeated between 20 and 80 times, 
advantageously between 30 and 50 times. 

According to a preferred embodiment of the invention, the analyte strand 
may be enzymaticaily or chemically modified either in vivo or in vitro prior to 
hybridization. Many systems for coupling reporter groups to nucleic acid 
compounds have been described, based on the use of such labels as biotin or 
digoxigenin. In still another embodiment of the invention sandwich hybridization 
may be used. In a preferred embodiment, the target sequence present in the analyte 
strand is converted into cDNA, with said cDNA being amplified by any technique 
known in the art such as by the polymerase chain reaction (PCR; Saiki et ai, 
1988), ligase chain reaction (LCR; Landegren et al, 1988; Wu & Wallace, 1989; 
Barany, 1991), nucleic acid sequence-based amplification (NASBA; Guatelli et 
aL, 1990; Compton, 1991), transcription-based amplification system (TAS; Kwoh 
et aL, 1989), strand displacement amplification (SDA; Duck, 1990; Walker et aL, 
1992) or amplification by means of Qb replicase (Lizardi et aL, 1988; Lomeli et 
aL, 1989). 

The cDNA amplification step is preferably achieved by means of PCR 
technology and may consist of steps: 

(a) providing a set of primers for a polymerase chain reaction method 
which flank the target sequence to be detected; 

(b) amplifying the target region via a polymerase chain reaction method 
by means of the primers of (a); and in the same step an appropriate label molecule 
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can be incorporated into the amplified target said label molecule being preferably 
digoxigenin or biotin. 

The term "primers" corresponds to oligonucleotide sequences being 
complementary to conserved regions of sense or antisense strands of cDNA or 
RNA derived from the HCV genome; preferably of the 5* untranslated conserved 
regions of the HCV genome and more preferably selected from conserved regions 
of the 5' untranslated region of the HCV genome comprising positions -341 to 
-171 and -67 to -1, or the core region. 

In an advantageous embodiment of the invention, the process is such that 
amplification consists of a double PCR step, each step involving a specific set of 
primers, with the said first step involving outer primers selected from the region 
extending from nucleotide -341 to nucleotide -186 and fi-om the region extending 
from nucleotide -52 to nucleotide -1, and more particularly the following set: 

CCC TGT GAG GAA CTW CTG TCT TCA CGC (No. 1) 

GGT GCA CGG TCT ACG AGA CCT (No. 2) 

or their complements, 
wherein W represents A or T, and with the second step involving nested primers 
selected from the region extending from nucleotide -326 to nucleotide -171 and 
from the region extending from nucleotide -68 to nucleotide -1 and, more 
particularly the following set: 

TCT AGC CAT GGC GTT AGT RYG AGT GT (No. 3) 

CAC TCG CAA GCA CCC TAT CAG GCA GT (No. 4) 
wherein R represents A or G and Y represents T or C 
or their complements. 

According to this embodiment of the invention, a double PCR is 
performed with outer primers in the first round including sequences as shown in 
SEQ ID NO 1 and 2, or their complementary sequences and with nested primers 
for the second round including sequences as shown in SEQ ID NO 3 and 4, or 
their complementary sequences. 

The term "appropriate label molecule" may include the use of 
labeled nucleotides incorporated during the polymerase step of the amplification 
such as illustrated in Saiki et aL (1988) and Bej et al. (1990) and or any other 
method known to the person skilled in the art. 

The assays as described in this invention may be improved in 
several ways obvious for the person skilled in the art. For example the cPCR 
reactions can be preceded by an RNA-capture step. 

According to yet another embodiment, the present invention 
relates to a composition comprising at least one oligonucleotide primer, with said 
primers preferably having at least 1 5 contiguous nucleotides, with said contiguous 
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nucleotides being respectively complementary to sequences chosen from the 
region extending from nucleotide -341 to nucleotide -171 and from the region 
extending from nucleotide -67 to nucleotide -l(, of figure 2), or their complement. 

According to yet another embodiment, the present invention relates 
to a composition comprising at least one oligonucleotide primer preferably having at 
least 15 contiguous nucleotides, with said contiguous nucleotides being chosen from 
any of the following sequences: 

CCC TGT GAG GAA CTW CTG TCT TCA CGC (No. 1 ) 
GGT GCA CGG TCT ACG AGA CCT (No, 2) 
TCT AGC CAT GGC GTT AGT RYG AGT GT (No. 3) 
CAC TCG CAA GCA CCC TAT CAG GCA GT (No. 4) 
wherein W represents A or T, R represents A or G, and Y represents T or C, 
or their complements. 

According to an advantageous embodiment, the process of the invention 
for the simultaneous typing of all HCV isolates contained in a biological sample 
comprises the step of contacting one of the following elements: 

- either said biological sample in which the genetic 
material is made available for hybridization, 

- or the purified genetic material contained in said 

biological sample, 

- or single copies derived from the purified genetic 

material, 

- or amplifified copies derived from the purified genetic 

material, 

with a solid support on which probes as defined above, have been previously 
immobilized. 

According to this preferred embodiment of the invention, the probes as 
defined above are immobilized to a solid susbstrate. 

The term "solid substrate" can refer to any substrate to which an 
oligonucleotide probe can be coupled, provided that it retains its hybridization 
characteristics and provided that the background level of hybridization remains 
low. Usually the solid substrate will be a ipicrotiter plate or a membrane (e.g. 
nylon or nitrocellulose). 

Prior to application to the membrane or fixation it may be convenient to 
modify the nucleic acid probe in order to facilitate fixation or improve the 
hybridization efficiency. Such modifications may encompass homopolymer 
tailing, coupling with different reactive groups such as aliphatic groups, NH2 
groups, SH groups, carboxylic groups, or coupling with biotin or haptens. 
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According to an advantageous embodiment of the invention, the process 
comprises the step of contacting anyone of the probes as defined above, with one 
of the following elements: 

- either a biological sample in which the genetic material is 
made available for hybridization, 

- or the purified genetic material contained in said 

biological sample, 

- or a single copy derived from the purified genetic 

material, 

- or an amplified copy derived from the purified genetic 
material, with said elements being previously immobilized on a support. 

The invention also relates to the typing of new isolates. 

More particulary the invention relates to a process for the detection and 
identification of novel HCV types or subtypes different from the known types or 
subtypes and comprising the steps of: 

- determining to which known types or subtypes the HCV 
isolate present in the biological sample belongs to, according to the process as 
defined above, possibly with said biological sample being previously determined 
as containing HCV, possibly by means of HCV antigen or antibody assays or with 
a universal probe for HCV, such as those defined above, 

- in the case of observing a sample which does not 
hybridize positively with at least one of the probes able to target the regions 
chosen from any of the two domains as defined above, sequencing the complete 
genome of the HCV type present in the sample, or alternatively sequencing that 
(the) portion(s) of the 5' untranslated region of the sample corresponding to a new 
type and/or subtype to be determined. 

Advantageously the process for the detection and identification of novel 
HCV types and/or subtypes, present in a biological sample, which are different 
from type 1, type 2, type 3, type 4, type 5, type 6, in the case of identifying a novel 
type; and which are different from subtypes la and lb for a type 1 HCV isolate, 
from subtypes 2a, 2b, and 2c for a type 2 isolate, from subtypes 3a, 3b and 3c for a 
type 3 isolate, from subtypes 4a, 4b, 4c, 4d, 4e, 4f, 4g and 4h for a type 4 isolate; 
from subtype 5a for a type 5 isolate; from subtype 6a for a type 6 isolate, in the 
case of identifying a novel subtype, and comprising the steps of: 

- determining to which known type(s) or subtype(s) the 
HCV isolate(s) present in the biological sample to be analyzed belongs, according 
to the process of the invention, possibly with said biological sample being 
previously determined as containing HCV, possibly by means of HCV antigen or 
antibody assays or with a universal probe for HCV such as the one defined above, 
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- in the case of observing a sample which does not 
hybridize to at least one of the probes able to target the regions chosen from any of 
the type specific or subtype specific domains as defined above, more particulary 
not hybridizing with SEQ ID NO 5, 28 and 6 for type 1, with SEQ ID NO 8 to 12 
or 22 to 26 and 32 to 34 for type 2, with SEQ ID NO 13, 14, 36, 21 or 54 for type 
3, and with SEQ ID NO 17, 18, 19, 37 to 43, 49, 50 and 53 for type 4; and with 
SEQ ID NO 7 and 30 for subtype lb, with SEQ ID NO 31 for subtype la, with 
SEQ ID NO 9, 10, 22 or 24 for subtype 2a, with SEQ ID NO 1 1, 12, 23 or 25 for 
subtype 2b, with SEQ ID NO 33 or 34 for subtype 2c, with SEQ ID NO 13, 14 or 
35 for subtype 3a, with SEQ ID NO 38, 21 and 19 for subtype 3b, 4a or 4d, with 
SEQ ID NO 38 or 41 for subtype 4b; with SEQ ID NO 42 or 43 for subtype 4c; 
with SEQ ID NO 39, 40, or 41 for subtype 4e, with SEQ ID NO 51, 38, 19 or 7; 
for subtype 4f; with SEQ ID NO 49 or 50 for the putative subtype 4h; with SEQ 
ID NO 50 or 53 for the putative subtype 4g, sequencing the complete genome of 
the HCV type present in the sample, or, alternatively sequencing that (the) 
portion(s) of the 5' untranslated region of the sample corresponding to a new type 
and/or subtype to be determined. 

The term "new isolates" corresponds to isolates which are not able to 
hybridize to any of the 9 above- mentioned regions or show reactivities which 
cannot be correctly interpreted as matching one of the currently knovm HCV types 
or subtypes. This special embodiment of the invention may also be performed by 
the steps of: 

(a) screening HCV antibody-positive sera, or clinical NANB hepatitis 
samples, or a population of random samples, by cPCR (cDNA PCR), 

(b) performing a HCV LiPA with those samples from which a cPCR 
product has been obtained, and 

(c) cloning and sequencing these PCR fragments showing aberrant 
reactivities. 

The invention also relates to a method for determining the type(s) as well 
as the $ubtypes(s) of HCV, and/or HIV, and/or HBV and/or HTLV present in a 
biological sample, which comprises the steps of: 
- providing: 

* at least one of the probes as defined above, preferably 
the probes as defined above, enabling the genotyping (typing and/or subtyping) of 
HCV, and at least one of the following probes: 

* probes capable of detecting oligonucleotides of HIV 
types 1 and/or 2 which can be present in said biological sample, and/or 
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* probes capable of detecting oligonucleotides of HBV 
subtypes, and /or sAg mutants, and/or cAg mutants which can be present in said 
biological sample, and/or 

* probes capable of detecting oligonucleotides of HTLV-I 
and/or HTLV-II suspected to be in the biological sample, 

- possibly providing a set of primers as defined above, as well at 
least one of the following primers: sets of primers to respectively amplify human 
immunodeficency virus (HIV), and/or HBV and/or human T-cell lymphotropic 
virus (HTLV) oligonucleotides, by means of PCR reaction and amplifying the 
oligonucleotides of HCV, and either HBV and/or HIV and/or HTLV possibly 
present in the biological sample, 

- contacting 

* the biological sample in which the genetic material is 
made available for hybridization, 

* or the purified genetic material contained in said 

biological sample, 

* or single copies derived from the purified genetic 

material, 

* or amplified copies derived from the purified genetic 

material, 

with the above-mentioned probes defined above under conditions which 
allow hybridization between the probes and the target sequences of isolates of 
HCV and at least one of the following viruses: HBV, and/or HIV, and/or HTLV, 

- detecting the complexes possibly formed between the probes 
used and the target regions possibly present in the biological sample. 

According to this embodiment, in addition to the type or subtype of HCV 
present in a biological sample, the invention also relates to a method for 
determining the type or subtype of any other parenterally transmitted viral isolate 
such as HTLV, HIV, HBV characterized by incorporating on one and the same 
strip, probes hybridizing specifically to: 

- the different types and/or subtypes of HCV as defined above, 

- human immunodeficiency viruses HIV-1 and HIV-2, 

- human T-cell lymphotrophic viruses HTLV-1 and HTLV-2, 

- the different HB surface antigen (HBsAg) mutants or HB core antigen 
(HBcAg) or HB precore Ag mutants. 

In some test samples, different target sequences of which the specific 
detection is of clinical relevance are present simultaneously. For each of these 
target sequences a separate hybridization test with the corresponding probe should 
be performed. The combination of different type/subtype specific probes 
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comprised in the art, in combination with the new and inventive HCV 
type/subtype-specific probes as explained in the present invention on one 
membrane strip could provide an easy and reliable general typing system for 
parenterally transmitted human viral diseases. If analyte strand amplification is 
necessary, a set of primers can be provided per viral organism to be differentiated 
and classified. 

The invention also relates to a solid support, particularly a membrane 
strip containing, on known locations of its surface, a selection of the following 
probes, or their complements, or the above-mentioned probes wherein T has been 
replaced by U: 

- NO 5, NO 6, NO 7, NO 8, NO 9, NO 10, NO 1 1, NO 12, NO 13, 
NO 14, NO 15, NO 16, NO 17, NO 18, NO 19, NO 20, NO 21, NO 22, NO 23, 
NO 24, NO 25, NO 26, NO 27, NO 28 to 54 and NO 93 to 96, as defined above, 
as well as a control to determine if there is hybridization between these probes and 
the ribo or deoxyribonucleotide strands of HCV, liable to be contained in a 
biological sample in which HCV isolates are to be differentiated. 

According to a specially preferred embodiment of the invention, the 
probes are inamobilized in a line-wise fashion to a membrane strip. 

In this preferred embodiment of the invention, a set of probes each 
applied to a known location onto the membrane strip include probes selected from 
the sequences with SEQ ID NO 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 26, 27, SEQ ID NO 28 to 54, and NO 93 to 96 as well as a 
control line for conjugate binding. 

The method of this preferred embodiment of the invention makes it 
possible to quickly determine the type of HCV infection. This assay provides the 
ability to discriminate between at least 6 different HCV types and might 
discriminate between at least 1 8 subtypes, and is a good instrument for searching 
for new types or (sub)types of HCV. For example, new subtypes Ic, Id and type 7, 
and other new (sub)types may contain specific mutations in the regions mentioned 
above, which can be employed for specific detection by means of type-specific 
probes derived firom such new sequences. 

The invention also relates to a kit for the in vitro discrimination of at least 
one HCV isolate from a biological sample liable to contain it, and for its 
classification it acccording to the HCV type and subtype, with said kit containing 

- a least one probe selected among any of those defined 

above; 

- a buffer or components necessary for producing the 
buffer enabling hybridization reaction between these probes and the cDNAs and/or 
RNAs of HCV isolates to be carried out; 
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- when appropriate, means for detecting the hybrids 
resulting from the preceding hybridization. 

The invention also relates to a kit for typing at least one HCV isolate 
from a biological sample liable to contain it and for classifying it acccording to the 
HCV type and subtype, with said kit containing 

- possibly one universal probe as defined above, 

- at least one probe selected among any of those of the 

invention, 

- a buffer or components necessary for producing the 
buffer enabling hybridization reaction between these probes and the DNAs and/or 
RNAs of HCV isolates to be carried out; 

- when appropriate, means for detecting the hybrids 
resulting &om the preceding hybridization. 

According to this embodiment, the invention also relates to a kit for 
genotyping (typing and/or subtyping) of HCV isolates comprising: 

- a set of probes as defined above, preferentially 
immobilized on a solid substrate, and more preferentially on one and the same 
membrane, and 

- possibly a set of primers as defined above, 

- a set of buffers necessary to carry out the hybridization as 
well as the detection of the hybrids formed. 

The invention also relates to a kit for typing HCV isolates belonging to at 
least one of the following HCV types: HCV type 1, HCV type 2, HCV type 3, 
HCV type 4, HCV type 5, HCV type 6 with said kit containing at least one of the 
probes as above defined, 

- the buffer or components necessary for producing the 
buffer enabling hybridization reaction between these probes and the cDNAs and/or 
RNAs of the above-mentioned HCV isolates to be carried out; 

- when appropriate, means for detecting the hybrids 
resulting from the preceding hybridization. 

The invention advantageously relates to a kit for the discrimination and 
classification of HCV types and subtypes, with said kit containing: 

- at least one of the probes as defined above, 

- the buffer or components necessary for producing the 
buffer enabling hybridization reaction between these probes and the DNAs and/or 
RNAs of the above-mentioned HCV isolates to be carried out; 

- when appropriate, means for detecting the hybrids 
resulting from the preceding hybridization. 
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It is to be mentioned that all the probes from SEQ ID NO 1 to SEQ ID 
NO 54 and SEQ ID NO 93 to 96 are new. 

Furthermore, probes of SEQ ID NO 18, 29, 33, 34, 35, 40, 42, 43, 47, 49 
and 54 are derived from new sequences. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 

Ethidium bromide-stained agarose gel showing the length of the nested 
PGR fragments. Lane A of each pair shows the PGR fragment with incorporation 
of Bio-ll-dUTP. Lane B is the PGR fragment without Bio-U-dUTP. 1: Serum 
BR28, 2: Serum BR24, 3: Serum BR29, 4: Serum BR33, 5: Serum BR36, 6 and 7: 
negative control sera, 8: Serum JP62, 9: Serum BR23, 10: cPGR control without 
template, M: molecular weight markers. 

Alignment of the 5' UR nucleotide sequences of isolates from four 
different types of HGV. Boxed nucleotides indicate the positions of probes used 
for typing of the four different groups. The underlined nucleotides are used for 
subtyping within each group. The period between nucleotide -140 and -139 in 
most sequences corresponds to the insertion in some of the type 4 isolates. The 
numbering of the probes corresponds with the numbers used in Table 4. 

Figure 3 

HGV LiP A Typing results of some representative sera. The strip contains 
19 parallel probe lines: 

A: Probe 5 (SEQ ID NO 5); B: Probe 6 (SEQ ID NO 6); G: Probe 7 (SEQ 
ID NO 7); D: Probe 8 (SEQ ID NO 8); E: Probe 26 (SEQ ID NO 26); F: Probe 22 
(SEQ ID NO 22) and Probe 24 (SEQ ID NO 24); G: Probe 10 (SEQ ID NO 10); 
H: Probe 13 (SEQ ID NO 13); I: Probe 14 (SEQ ID NO 14); J: Probe 21 (SEQ ID 
NO 21); K: Probe 15 (SEQ ID NO 15); L: Probe 16 (SEQ ID NO 16); M: Probe 
17 (SEQ ID NO 17); N: Probe 19 (SEQ ID NO 19); O: Probe 18 (SEQ ID NO 18); 
P: Probe 155 (antisense probe: 5'-GGGGGCCTGGAGGGTG-3') (SEQ ID NO 
97); Q: Probe 27 (SEQ ID NO 27); R: Probe 20 (SEQ ID NO 20); S: control line 
for conjugate binding. 

The strips were hybridized with cPCR products of the following sera: 
Strip 1: serum BR5, Strip 2: serum BR12, Strip 3: serum BR18, Strip 4; serum 
BR22, Strip 5: serum BR19, Strip 6: serum BE95, Strip 7: serum BU79, Strip 8: 
serum BR23, Strip 9: serum JP63. 
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Figure 4 

Nucleotide sequence alignment of the HCV 5'untranslated regions of new 
isolates BE90, BE91, BE92, BE93, BE94, BE95, BE96, BE97, BE98, BE99, 
GB48, GB116, GB358, GB569, GB549, GB809, CAM600, CAM736, GB478, 
GB724, and GB438, with sequences of HCV type la (HCV-1), lb (HCV-J), 2a 
(HC-J6), 2b (HC-J8), 3a (BR56), 3b (HCV-TR), 5 (SAl), 6 (HKl). The sequences 
used to construct this alignment are taken from the EMBL database and have the 
following accession number: » M62321, ^ D 10749, ^ D00944, ^ DO 1221, 5 
D13448, ^ Dl 1443, M84838, and 8 L08156, The sequences between nucleotides 
-220 and -180 are not shown, they are identical to HCV-1 in all isolates. 
nucleotide is identical to the corresponding nucleotide in HCV-1; \,\ gap created 
between -145 and -144 to allow alignment with type 6 sequences which have a CA 
insertion; gap created between -138 and -137 in most of the sequences to 
preserve aligrunent with sequences which have an extra nucleotide at that position. 
* refers to the conserved HCV sequence between residues -220 and -180 as shown 
in Figure 2. 

FiguTQ 5 

Amino acid sequence alignment of the NS5 sequences of isolates BE90, 
BE91, BE92, BE93, BE95, GB358, GB549, and GB809 with known sequences as 
described in Table 6. 

Figure 6 . 

Line probe assays including probe with SEQ ID NO 32, tested with type 
1 and 2 sera. 1, type lb serum BE82, 2, type 2a serum JP62, 3, type 2b serum 
BE91, A, conjugate control, B, probes 20 and 27, C, probe 8, D, probe 26, E, 
probe 32 (SEQ ID NO 32). 

Figure 7 

Line probe assays including probes with SEQ ID NO 33 and 34, tested 
with type 2a, 2b, and 2c sera, 1, type 2a serum JP62, 2, type 2b serum BE91, 3, 
type 2c serum BE92, A, conjugate control, B, probes 20 and 27, C, probe 8, D, 
probe 26, E, probe 32, F, probe 22, G, probe 24, H, probe 23, 1, probe 25, J, probe 
33 (SEQ ID NO 33), K, probe 34 (SEQ ID NO 34). 

Figure 8 . 

Line probe assays including probes with SEQ ID NO 31, 37 and 38, 
tested with type 4 sera, 1, type 4a serum GBl 16, 2, serum GB113, 3, type 4f serum 
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GB438, A, conjugate control, probes 20 and 27, C, probe 37 (SEQ ID NO 37), 
D, probe 38 (SEQ ID NO 38), E, probe 19, F, probe 31 (SEQ ID NO 31), G, probe 
7. 

Fig ure , 9 

Line probe assays including probes with SEQ ID NO 44, 45 and 46, 
tested with type 4a and 5a sera. 1, type 5a serum BE95, 2, type 4a serum 08116, 
A, conjugate control, B, probes 20 and 27, C, probe 44 (SEQ ID NO 44), D, probe 
45 (SEQ ID NO 45), E, probe 46 (SEQ ID NO 46), F, probe 31 (SEQ ID NO 31), 
G, probe 7. 

Figure 10 

Line probe assays including probes with SEQ ID NO 93, 94, 95, and 96, 
tested with type 4a and 5a sera, 1, type 4a serum GBl 16, 2, type 5a serum BE95, 
A, conjugate control, B, probes 20 and 27, C, probe 93 (SEQ ID NO 93) applied at 
a concentration of 0,4 pmoI/|il, D, probe 94 (SEQ ID NO 94) applied at a 
concentration of 2,5 pmol/|il, E, probe 94 (SEQ ID NO 94) applied at a 
concentration of 1.0 pmol/|Lil, F, probe 94 (SEQ ID NO 94) applied at a 
concentration of 0,4 pmol/fil, G, probe 95 (SEQ ID NO 95) applied at a 
concentration of 2.5 pmol/|al, H, probe 95 (SEQ ID NO 95) applied at a 
concentration of 1.0 pmol/[il, I, probe 95 (SEQ ID NO 95) applied at a 
concentration of 0.4 pmol/|il, J, probe 96 (SEQ ID NO 96) applied at a 
concentration of 0.4 pmol/^1. 

T^ble 1 

Overview of the different classification systems. 
Table 2 

Interpretation of the results shown in Figure 3. 
Table 3 

Final results of HCV LiPA typing and HCV antibody assays. 

A stmimary of the typing in relation to the serology is presented. The 
INNO-LIA HCV Ab assay contains one line with NS4 epitopes, one line with NS5 
epitopes, and 4 lines with core epitopes. Only the highest score for the core lines is 
given. The intensity of the signal is given by a number: 0 = negative; 9 = 
indeterminate; 1 to 3 — positive. The final interpretation of the antibody test is 
given in the LIA column: 1 = positive; 0 = negative; 9 = indeterminate. 
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The signal-to-noise ratio of the sera tested in the Innotest HCV Ab is also 
given for some of the sera. 

Table 4 

Nucleotide sequence, position, and orientation of the primers and probes. 
Table 5 

Overview of new probes designed from new types and subtypes of HCV. 
The type or subtype for which classification is possible or improved, the sequence, 
and the SEQ ID NO. are shown. 

* represents a probe which does not type or subtype all isolates found 
representing said type or subtype. The underlined letters indicate provisional 
divisions into subtypes. 

Table 6 

Sequence homology between BE90, BE91, BE92, BE93, BE95, GB358, 
GB549, and GB809 and published sequences in the HCV NS5 region from 
nucleotide 7935 to 8274, according to the numbering used in the present 
invention. Homology scores within the same subtype are in bold. Published 
sequences used to perform homology calculations were taken from the EMBL 
database and have the following accession numbers: ^ M62321, ^ D 10749, ^ 
M67463, 4 D90208, ^ X61596, ^ L02836, ^ M84754, » D10750, ^ D11168; 
D01171, 10 S38204, M58335, ^2 D10078, D10079, i4 D10080, D10081, 
D00944, and ^'^ D01221. All of them represent complete genomes, except ^\ ^\ 
^5 and 1^ for which NS5 sequences were published. was published in the Chiron 
patent WO 92/1 9743, SEQ ID NO 1 8. 

EXAMPLES 

In order to study the natural variation of HCV isolates obtained from 
different geographical areas throughout the world, a rapid means for typing and 
subtyping of HCV isolates in the form of a Line Probe Assay (LiPA) was 
developed. 

Essentially, a cPCR fragment containing incorporated biotinylated dUTP 
is hybridized to oligonucleotides which are immobilized on a nitrocellulose 
membrane. The stable hybridization duplex is then revealed by streptavidin- 
labelled alkaline phosphatase, and subsequent color development with NBT (nitro 
42 blue tetrazolium) and BCIP (bromochloro-indolyl phosphate). The cPCR 
fragment is synthesized from the 5* UR of any HCV RNA using highly conserved 
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sets of primers. The oligonucleotides used for typing are directed against the 
internal type-specific variable parts of the cPCR fragment. In fact, the 2 variable 
regions between positions -170 and -155, and between -132 and -1 17 in the linear 
sequence may be part of a stem in the folded viral RNA, and mutations in the first 
region may be complemented by another mutation in the second region to allow or 
disallow RNA duplex formation. Variation and conservation is expected to occur 
at the same positions in other new types of HCV as well and, therefore, this 
variable region might remain instrumental for the discrimination between all 
current and yet-to-be discovered types of HCV. Moreover, since higher 
variabilities compared to the 5' UR are observed in the core, NS3, and NS5 
regions, typing in these regions employing universal sets of primers might no 
longer be tenable. 

The proposed nomenclature of this invention is provisional and could still 
be subject to amendments according to new guidelines that may be set forward by 
international committees. For example, subtype 4a might be changed into another 
type 4 subtype, like 4c or 4e, and type 4 might be changed into type 5 or 6, in 
which case type 4a might become 6c, for example. However, new classification 
systems will not hamper classification of a certain group of isolates classified into 
a type or subtype by means of the proposed probes of the invention. 

i. Serum samples used for typing and subtyping 

Sixty-one Brazilian samples (BRl to BR61) were tested in the HCV 
Antibody ELISA assay (Innotest HCV Ab, Innogenetics) as well as in the Inno- 
LIA HCV Ab test (Innogenetics). The first 23 serum samples (BRl to BR23, 
Table 3) were taken from hemodialysis patients with either high ALT levels or 
positive Inno-LIA results, or fi:om blood donors from which the recipient 
developed NANB hepatitis liver disease. Fourteen (BR24 to BR37) of the other 
serum samples were randomly chosen; the 24 remaining sera (BR38 to BR61) 
were selected on the basis of their LIA pattern. Most of the latter showed weakly 
positive, indeterminate, or negative reactivity with the NS4 and NS5 synthetic 
peptides on the LIA. The following sera were also included in this typing effort: 
two pools of Japanese sera (JP62 and J63), six Belgian sera (BE64 to BE69), four 
sera from the Netherlands (NE70 to NE73), six sera from Burundi (BU74 to 
BU79) and two sera from Gabon (GB80 and GB81). They were all tested with the 
Inno-LIA HCV Ab assay system. The sera BU74 to BU78 were only positive for 
anti-core antibodies, while the serum BU79 reacted only with the NS5 line. Both 
Gabonese sera were LIA HCV negative (Inno-LIA HCV), HIV negative (Innotest 
HIV), but HTLV positive (Innotest HTLV). One serum from Belgium (BE69) and 
one from the Netherlands (NE73) were completelv negative. Three of the NF-^^prpi 
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(NE71 to NE73) were selected because they were negative in the second 
generation RIBA test (Ortho Diagnostics Inc.). 

2. cPCR, analysis of the PCR product, and cloning 

The primers used for the PCR reactions were complementary to the 
conserved areas of the 5' UR of the different HCV types. Degeneration was 
included to allow annealing to type 1 and type 2 sequences (Kato et al, 1990; 
Nakao et al, 1991; Okamoto et aL, 1991) and to the sequence of our type 3 clone 
(BR56; accession number D 13448, DDJJB/EMBL/GenBank DNA data base 
deposited on 21/10/1992). The sequences of the outer PCR primers (HcPr98, SEQ 
ID NO 1 and HcPr29, SEQ ID NO 2) and of the nested PCR primers (HcPr95, 
SEQ ID NO 3 and HcPr96, SEQ ID NO 4) are listed in Table 4. The probes used 
for the detection of the different serum types are also listed in Table 4. All 
oligonucleotides were synthesized on a 392 DNA/RNA Synthesizer (Applied 
Biosystems). 

Viral RNA was extracted from serum essentially as described by 
Chomczynski and Sacchi (1987) with minor modifications. The RNA was 
coprecipitated with 20 jig Dextran T500 (Pharmacia). The RNA pellet was briefly 
dried and resuspended in 10 p.1 DEPC-treated H2O. After adding 2 |il 150 ng/|ii 
random primers (Pharmacia) and denaturating for 10 minutes at 65*^0, the first 
strand cDNA synthesis was carried out in 20 |il at 42°C in the presence of 25 U 
HPRI (Amersham), 500 dATP, dCTP, dTTP and dGTP, 1 x AMV buffer 
(Stratagene) and 2.5 U AMV-RT (Stratagene). Seven |il of the resulting cDNA 
was amplified in an outer PCR over 40 cycles each consisting of 1 min 95^C, 1 
min 55*^C and 1 min 72*^C in a total volume of 50 |il. The solution was adjusted to 
a final concentration of 200 ^iM of dATP, dCTP, dTTP and dGTP, 1 x Taq buffer 
(Stratagene), 0.2 iiM of each primer, and 1 U Taq polymerase (Stratagene). One [il 
of the first round amplification product was amplified with the nested primers 
again for 40 cycles in a buffer with the same composition. For HCV typing, the 
nested PCR contained 40 Bio-ll-dUTP (Sigma) and 160 |iM of dTTP. Both 
the outer and the nested PCR product were then subjected to electrophoresis in a 
2% low melting point (NuSieve GTG, FMC)/1% Ultra Pure (Gibco BRL) agarose 
gel. After ethidium bromide staining, PCR fragments were cut out from the 
agarose gel, the DNA was recovered by centrifugation through a 0.45 \xm HV 
membrane (Millipore), purified by two phenol/ chloroform and two ether 
extractions, precipitated, and subsequently polished with T4 DNA polymerase 
(Boehringer), kinated with T4 kinase (Boehringer), and finally ligated in the 
dephosphorylated Eco RV site of pBluescript KS(-) (Stratagene). Plasmid DNA 
preparation was as described in the alkaline lysis method (Maniatis et al, 1982). 
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Sequencing reactions were carried out on double-stranded plasmid DNA with T7 
and T3 primers by using the Deaza G/A T7 sequencing mixes (Pharmacia). 

The results of these sequencing reactions are shown in Figure 2. The 
following sequences were deposited in DNA databases (BR56: 
DDBJ/EMBL/Genbank accession number D13448; BU74: DDBJ/EML/GenBank, 
accession number D13449; BU79: accession number D13450; GB80: accession 
number D13451; GB81: accession number D13452; GP62: accession number 
D 13453). 

Serum RNA from HCV-infected patients was used as template for cDNA 
synthesis, which in turn was a template for nested PGR. Two sets of PGR primers 
were designed: HcPr98 (SEQ ID NO 1) and HcPr29 (SEQ ID NO 2) for the outer 
reaction, HcPr95 (SEQ ID NO 3) and HcPr96 (SEQ ID NO 4) for the nested 
reaction (Table 4). These four primers were chosen to match the published 
sequences (Kato et al, 1990; Nakao et al, 1991; Okamoto et aL, 1991) and the 
sequence of a clone obtained from the untranslated region of isolate BR56. (see 
Figure 2). The resulting amplification product of the nested PGR is 235 base pairs 
(bp) long. Due to the incorporation of Bio-ll-dUTP, there is a decrease in 
mobility which is clearly visible after agarose gel electrophoresis (Fig. 1). The size 
of the DNA fragments is the same for all the different HGV types, suggesting that 
a second experiment, like restriction enzyme digestion or hybridization, is 
necessary for classification. A membrane strip containing immobilized HGV- 
specific oligonucleotide probes applied as parallel lines was therefore developed. 
These strips are hybridized with PGR amplified DNA fragments of the 5' UR into 
which biotinylated nucleotides were incorporated during synthesis. After 
hybridization, streptavidin labelled with alkaline phosphatase is added and 
becomes bound to the biotinylated hybrids formed during the hybridization. After 
incubation with NBT/BGIP, a purple precipitate appears. 

J. Preparation of the line probe assay (LiPA) strips 

The 16-mer oligonucleotides, specific for the different types or subtypes 
of HGV (Table 4, number 5 to 27), were provided with a poly-(dT) tail at their 
3' end as follows: 20 pmol of primer was incubated in 25 |il buffer containing 3,2 
mM dTTP, 25 mM Tris.HGl (pH 7.5), 0,1 M sodium cacodylate, 1 mM G0GI2, O.l 
mM dithiothreitol, and 60 U Terminal deoxynucleotidyl Transferase (Pharmacia) 
for 1 hour at 37^G. The reaction was stopped by adding 2.5 \i\ 0.5 M EDTA (pH 
8.0) and diluted with 20 x SSG (Maniatis et al, 1982) until a final concentration 
of 6 X SSG and 2.5 pmol oligo nucleotide/ jil was reached. 

One pmol of this solution was applied over a distance of 4 mm on a 
nitrocellulose membrane. As control for the conjugate, biotinylated DNA was 
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applied alongside. The oligonucleotides were fixed to the membrane by baking at 
80*^C for two hours. The membrane was then sliced in 4-mm strips. 

4. LiPA test hybridization and color development 

Ten \i\ of the nested PGR amplification product, containing incorporated 
Bio-ll-dUTP, is mixed with 10 \x\ of 400 mM NaOH/10 mM EDTA and 
incubated at room temperature (RT) for 10 minutes. Then, 1 ml prewarmed (Sy'C) 
hybridization buffer containing 3 M tetramethylammonium chloride (TMACl, 
Merck), 50 mM sodium phosphate (pH 6.8), 1 mM EDTA, 5 x Denhardts 
(Maniatis et aL, 1982), 0.6% (w/v) SDS and 100 ng/ml sheared salmon sperm 
DNA is added and the hybridization is carried out in a shaking water bath at 
for 2 hours (Jacobs et al, 1988). The strips are washed 2 times at RT for 5 minutes 
with 1 ml prewarmed (37°C) wash buffer (3 M TMACl, 0.2% SDS, 50 mM 
Tris.HCl, pH 8.0), followed by a stringent wash at 51°C for 30 minutes and two 
brief washing steps at RT. At this moment, the wash buffer is replaced by Rinse 
Solution (phosphate buffer containing NaCl, Triton, 0.5% NaN3; Inno-Lipa, 
Innogenetics, Antwerp, Belgium) and the strips are rinsed twice with 1 ml at RT. 
Finally, the strips are rinsed with Conjugate Diluent (phosphate buffer containing 
NaCl, Triton, protein stabilizers, 0.1% NaN3; Inno-Lia, Innogenetics, Antwerp, 
Belgium) and incubated with Conjugate Diluent containing 4000 x diluted 
streptavidin, labelled with alkaline phosphatase (Gibco BRL) for another 30 
minutes at RT. The strips are washed again 3 times with Rinse Solution and once 
with Substrate Diluent (Tris buffer containing NaCl and MgCl2; Inno-Lia, 
Innogenetics, Antwerp, Belgium). Color development is achieved by adding BCIP 
and NBT to the Substrate diluent and incubation of the strips for 30 minutes at RT. 
The color development is stopped by replacing the buffer with distilled water. 

5, A LiPA for discrimination between HCV types 1, 2 and 3 

The sequences for the probes against type 3 were derived from a cPCR 
clone from serum BR56 (accession number D 13448). When comparing the 
published type 1 sequences with BR56, two regions of 16 nucleotides containing 4 
to 6 mutations could be observed each time. Surprisingly, when type 2 sequences 
became available, variation was again maintained in these two regions. Therefore, 
the position of the typing probes was chosen in those regions with a relatively low 
degree of homology between types, but good conservation within one type. In a 
first version of the strips, a total of eight separately immobilized oligonucleotides 
were applied. Two of them were directed against type 1 (HcPrl24, SEQ ID NO 5 
and HcPrl25, SEQ ID NO 6), four against type 2 (HcPrl36, SEQ ID NO 9 and 
HcPrl37 (SEQ ID NO 10) for type 2a, HcPrl26 (SEQ ID NO 11) and HcPrl27 
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(SEQ ID NO 12) for type 2b) and two against type 3 (HcPrl28, SEQ ID NO 13 
and HcPrl29, SEQ ID NO 14) HCV (Table 4). 

cPCR products were synthesized from 23 Brazilian sera (BRl to BR23) 
and, after hybridization, 17 of them recognized the 16-mers of type 1. Four type 3 
sera were found, as well as one type 2a serum. Serum BR23 was co-infected with 
type 1 and type 3. Two pools of Japanese sera were subsequently tested: JP63 
reacted with the type 1 and type 2a probes, and the majority of the JP62 pool 
contained type 2a sequences. After cPCR cloning and sequencing the region 
between the primers HcPr95 (SEQ ID NO 3) and HcPr96 (SEQ ID NO 4), the 
sequence of JP62 (Figure 2) was confirmed as type 2a. The type 2b probes 
HcPrl26 (SEQ ID NO 1 1) and HcPrl27 (SEQ ID NO 12), to which JP62 did not 
react, differed by only one and two nucleotides, respectively, from the sequence of 
JP62 (accession number D 13453). Therefore, the chosen hybridization and 
washing conditions were very stringent and that even single mismatches abolish 
hybridization in this assay. 

6. Discrimination between subtypes 

After careful comparison of all available type 1 coding sequences, two 
subtypes (la and lb) can clearly be distinguished, with an average genome 
homology of 79%. In the 5' UR, only 2 mutations were observed between HCV-J 
and HCV-1 in the region of the nested PGR product, resulting in 98,8% homology. 
Although only 2 mutations were present between HCV-1 (la) and HCV-J (lb), the 
A-to-G transition observed at position -99 occurs in all type lb isolates studied so 
far. Therefore, hybridization to probe HcPrl38 (SEQ ID NO 7), which spans the 
region of the G substitution, is indicative of a type lb isolate. 

When comparing all available 5' UR sequences of type 3 (present 
invention; Bukh et al, 1992; Chan et aL, 1992; Lee et al, 1992), the isolates could 
be divided into two groups according to the presence of a common G (type 3a; 
HcPrl40, SEQ ID NO 15) or a more rare A (type 3b; HcPrl39, SEQ ID NO 16) at 
position -139. Discrimination between types 2a and type 2b (or K2a and K2b) 
could be made in the variable regions as reported above. 

The combination of all these type- and subtype-specific probes for type 1 
and 3 (Table 4) allowed us to separate the 17 Brazilian sera which previously had 
been characterized as type 1 into 8 type la and 9 type lb sera. Three of the four 
type 3 sera formed hybrids with the type 3a line. Different molecules in the cPCR 
fragment of the co-infected serum BR23 hybridized with the lines for type lb and 
type 3a (Figure 3, strip 8). 

Another 38 Brazilian sera (BR24 to BR61) were tested in this new LiPA. 
The most dominant criterium for the selection of these sera was the absence of 
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antibodies for NS4 and NS5 epitopes, since earlier reports showed that there was a 
low degree of cross-reactivity between type 2 and type 3 anti-NS4 antibodies with 
type 1 NS4 antigens (Chan et al, 1991). Of the 38 Brazilian sera, 12 could be 
typed as type la, 14 as type lb, 9 as type 3a, 2 as type 3b and a coinfection of type 
lb and 3a. It was concluded that all the tested Brazilian sera could be typed. It 
remains to be determined whether the discrimination between type 3 subtypes is 
relevant. As no sequence data from the 5' UR of the Ta and Tb isolates (Mori et 
aL, 1992) has been published, our division into type 3a and 3b is still tentative. 
More data about the serology and the sequences of the open reading frame are 
needed to confirm type 3 and type 4 subtyping. 

7. Identification of type 4 isolates and incorporation of type 4-specific 
probes in the LiPA 

PGR fragments amplified from 6 Burundian sera (BU74 to BU79) failed 
to react with any of the 16-mers on the strips. Three PGR fragments from these 
Burundian samples (BU74: accession number D 13449, which was identical to 
BU76, and BU79: accession number D 13450; Figure 2) were cloned and 
sequenced. Sequences that were clearly different from most of the previously 
described types were obtained. The Burundian samples are related to each other, 
and to Z6 (Bukh et al., 1992) and show higher homologies to type 1 than to type 3 
or type 2. However, most of the differences with type 1 were again located in the 
variable regions. The most surprising finding was the presence of one extra 
nucleotide in BU74 and BU76 between the positions -139 and -140. These results 
argue in favor of the existence of new HCV type(s) or subtype(s), which will be 
provisionally called type 4. The sequences of the 5' UR of the virus that could be 
amplified from these African sera were strongly divergent from the previously 
described types. Therefore, these isolates have been tentatively designated as type 
4. Similar sequences commimicated in the study of Bukh et al. (1992), also 
originated from Africa, although one was from Denmark. Figure 2 shows that in 
the region between nucleotides -291 and -55, as many as 8 nucleotide variations 
are possible within this group. It is likely that type 4 is further composed of several 
subtypes, or that these subtypes are divergent subtypes of type 1. 

After obtaining these data, the LiPA was improved in three ways. First, 
oligonucleotide HcPrl42 (SEQ ID NO 20), carrying one degeneration, was chosen 
from a highly conserved region as universal HCV probe for the confirmation of 
the presence of the PGR product (Table 4). Secondly, three oligonucleotides were 
synthesized for identification of the type 4 sequences (HcPrl44, SEQ ID NO 17 
with one degeneration, HcPrl45, SEQ ID NO 18 and HcPrl46, SEQ ID NO 19; 
Table 4). Thirdly, a universal type 2 probe was selected outside the variable 
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regions (HcPrl47, SEQ ID NO 8, Table 4), since a universal probe for the 
detection of type 2 could not be chosen from the regions between positions -170 
and -155 and between positions - 132 to -1 17, 

With this version of the LiPA, the 6 PGR fragments from the Burundian 
sera (Table 3) were classified as type 4 as expected (Fig. 3, strip 6 and 7). Two 
Gabonese sera, 4 sera from the Netherlands and 6 Belgian sera were also included 
in the screening. From GB80 a type 4 HCV 5' UR could be amplified, which was 
cloned and sequenced (Fig. 2). The other Gabonese serum GB81 showed a 
coinfection of a variant of type 2 (cloned and sequenced, Fig. 2) and type 4. The 
latter gave the same typing pattern as BU79 (Fig. 3, strip 7), To establish whether 
reaction of GB81 with the type 2 and type 4 probes was caused by unexpected 
cross-reactivity between typing probes, or merely the result of a coinfection, the 
cPCR product was cloned and 17 individual colonies were subjected to PGR and 
HGV LiPA. Ten (59%) colonies contained type 4 inserts and seven were type 2 
(41%), clearly indicating the co-circulation of 2 types of HGV in the same serum. 
For the three NE-sera which were negative in the Ortho RIBA test and positive 
(NE71), indeterminate (NE72) or negative (NE73) in the Inno-LIA HGV Ab test, 
it could be shown that type 3a isolates were present. The fourth NE serum, which 
showed good reactivities in both Ortho RIBA and INNO-LIA, contained a type la 
isolate. Finally, from the 6 Belgian sera analyzed, BE64 to BE67 were infected 
with type lb strains. One patient of Italian origin (BE68) had a type 2a infection, 
and BE69 contained type 3a sequences. The latter was obtained from a case of 
chronic, viral-like NANB hepatitis, but was negative in all second generation 
assays and anti-NS3, anti-El, and anti-E2 research assays. This serum had a very 
low virus titer and became weakly positive only after the second round of PGR in 
four different samples taken during 2 years, showing the need for nested cPCR in 
HCV diagnosis. The sequence of the nested PGR fragment was identical with 
BR56. This was not surprising, since type 3 strains show very little sequence 
variation. 

In total, 19 different oligonucleotides were used for this version of the 
LiPA strips as shown in Figure 3. Because some of the oligonucleotides are 
directed against the same HGV subtype, probe HcPrl56 (SEQ ID NO 22) was 
pooled with HcPrl58 (SEQ ID NO 24) for type 2a. The oligonucleotides against 
type 4 were applied separately because too little sequence information from the 
coding region is known at this moment and hence, no division into subtypes (if 
any) can be made as yet. The presence of an extra base in some of the type 4 
sequences can form the basis for further attempts to subtype this group. The results 
obtained with some representative sera are shown in Figure 3. The interpretation 
of these strips is given in Table 2. 




-40- 

In this study, 61 PCR-positive Brazilian HCV sera were typed. Twenty 
(33%) sera had a type la HCV infection, 23 (38%) were type lb, one (1.5%) type 
2a, 15 (24.5%) type 3, and two (3%) sera with coinfections were found. The 
recognition of coinfected sera is illustrated by BR23 (Fig. 1, lane 9; Fig. 3, strip 8). 
The remaining 20 sera were collected from 5 different countries; 8 of the sera 
originated from two African countries. 

In a minority of the cases such as was the case for BE67, a type lb PGR 
fragment recognized the 3b subtype probe HcPrl39 (SEQ ID NO 16). This can be 
explained by assuming that the lb sequence of serum BE67 has an A instead of a 
T at position -139. The results obtained with the JP62 (accession number D 
13453) sequence, where one mismatch in the oligonucleotide abolishes the 
hybridization signal, further supports this assumption. Since isolate-specific 
mutations are scattered throughout the 5* UR, it is possible that an isolate of a 
given type also hybridizes to a subtyping probe of another type (see Fig. 3, strips 6 
and 7). Such reactivities merely indicate the presence of the sequence of the 
subtyping probe in the isolate studied. However, reactivities with multiple typing 
probes were never observed, unless a serum was coinfected, as investigated for 
GB81. 

In general, when a type la cPCR product hybridized on the LiPA, the 
sequence of the probes HcPrl24 (SEQ ID NO 5), HcPrl25 (SEQ ID NO 6) and 
HcPrl42 (SEQ ID NO 20) must be present in the nested cPCR fragment. 
Consequently, 48 (26%) bp of 184 bp (Figure 2) are immediately known. 
Following the same reasoning, it can be calculated that for isolates similar to the 
HCV J type 33%, to the HC J6 type 35%, to the BR56 type 34%, to the Z6 and 
BU77 type 26%, to the BU74 type 41% and to the BU79 type 32% of the sequence 
is known. However, it must be taken into account that due to the degeneration of 
some of the 1 6-mers, some information is lost and, hence, these percentages are 
maximum scores. Nevertheless, this approach supports the idea of the sequencing 
by the hybridization principle (Strezoska et al., 1991). 

When comparing LiPA with antibody reactivity of these sera in our Inno- 
LIA HCV Ab assay (Table 3) some correlations between genotypes and their 
phenotypes (serotype) emerge. The type 3 and 4 sera from Belgium, the 
Netherlands, Gabon, and Burundi all react very weakly positive, indeterminate, or 
negative in the second generation antibody assays. The weakly positive reaction is 
mostly caused by anti-core antibodies, whereas antibodies against the LIA NS4 
and NS5 epitopes are usually absent. This is in agreement with the high 
conservation of core sequences encoding only slightly different epitopes which 
allow immunological cross-reaction. Epitopes for the NS4 and NS5 region are 
located in highly variable regions, disabling most of the immunological cross- 
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reaction. As the current antibody assays contain type 1 epitopes, it is possible that 
a few percent of type 2, type 3, and type 4 infected sera will show a negative 
result However, the conclusion of lack of cross-reaction of the type 3 Brazilian 
sera with type 1 NS4 and NS5 antigens cannot be drawn from our results (Table 
3). For the 14 randomly chosen sera (BR24 to BR37; Table 3), there was 100% 
correlation between the LIA reactivity and the 9 type 1 viruses. From four type 3 
sera, two (BR34 and BR36) reacted with NS4 and three (BR33, BR34 and BR35) 
with NS5. BR37 was not taken into account because of the coinfection. When all 
serological data of the 77 sera infected by a single type were analyzed, 58% and 
44% of the type 1 sera recognized the NS4, and NS5 epitopes, respectively. These 
percentages are rather low and due to the selection criteria. For the type 3 sera, 
37% and 53% were reactive with the NS4 and NS5 epitopes, respectively. It is 
possible that higher cross-reactivities are observed in high-risk groups, such as in 
those samples obtained from Brazil, as compared with results in European blood 
donors (present invention and Chan et aL, 1992). Such cross-reacting sera could 
be induced by multiple infections, some of which occur simultaneously, but others 
might occur after one another. A previous anti-HCV memory could be boosted by 
new HCV infections and result in co-circulation of viruses of one type with 
antibodies mainly directed against another type. Such an explanation is plausible 
for serum BR56, which has been typed as HCV type 3, but contained antibodies to 
type 1 core. El, E2, NS3, NS4, and NS5 (data not shown). It remains to be 
determined whether anti-type 3 antibodies are present in this serum. 

Besides the differences in immune response, different HCV types could 
also show different progression to long-term liver disease, as has already been 
reported (Okamoto et al., 1992a). 

In conclusion, the LiPA allows a rapid determination of the type of HCV 
infection. This assay has the ability to discriminate between 4 different HCV types 
and 8 subtypes, and is a good means for determining new types. 

Moreover, this assay can be further improved by, for example, replacing 
the cPCR reactions by the RNA-capture PGR. Finally, this assay could prove to be 
instrumental in further establishing the relation between genotypes, future 
serotypes, and the clinical status or outcome of the disease. 

8. Identification of new types and subtypes and probes useful for their 
classification. 

Isolates BE82, BE90, BE91, BE92, BE93, BE94, BE95, BE96, BE97, 
BE98, obtained from Belgium; GB48, GB116, GB358, GB569, GB549, GB809, 
GB487, GB724, and GB438, obtained from Gabon; CAM600 and CAM736, 
obtained from Cameroun; were retained for further study because aberrant 
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reactivities were observed after genotyping by means of a LiP A including probes 5 
to 27 according to examples 3 and 4. The sequences of the 5'untranslated region 
were obtained after nested PCR by means of primers with SEQ ID NO. 1, 2, 3 and 
4, cloning, and sequencing as described in example 2. Sequence information was 
obtained in the NS5 coding region for most of these isolates, and an alignment 
with known sequences is presented in Figure 5, The homologies of NS5 nucleic 
acid and amino acid sequences of representative isolates for each subtype with the 
sequences of published isolates is presented in Table 6. This calculation allows 
classification into types and subtypes, as presented in Figure 4. Nucleotide 
sequence alignment of the 5'untranslated regions of these new isolates with some 
prototype sequences is also presented in Figure 4. Several mutations can be 
observed compared to the HCV-1 sequence. As identical mutations In the 
5'untranslated region correlate with similar sequences in the coding region, such 
mutations are employed in the present invention to design new type and subtype- 
specific probes. 

BE82, a subtype lb isolate, showed a C mutation at position -94, and 
therefore could not react vnth probe 7. After sequencing of the NS5 region, it 
could be concluded that this isolate belonged to subtype lb. Therefore, probe 30, 
including a degeneration of T and C at position -94, should enable better 
genotyping of subtype lb. 

BE90, another subtype lb isolate, showed a T mutation at position -159 
and a G mutation at position -126, and therefore only reacted with the universal 
probes 20 and 27 and the subtyping probe 7, Sequencing of the NS5 region taught 
that the isolate belonged to subtype lb. Probe 28, including a degeneration of T 
and C at position -126 should enable better genotyping of types 1 and 6. 

Isolate BE92 reacted only with probes 8 and 26 in addition to the 
universal probes 20 and 27. Thus, this isolate could be classified as type 2, but 
could not be subtyped because no reactivity with probes 23, 24, 25, or 26 could be 
observed. After sequencing, two new motifs could indeed be observed: 
GGACCCAGTCTTCCTG, covered by probe 33, and TGCCTGGTCATTTGGG, 
covered by probe 34. Sequencing of the NS5 region indeed revealed homologies 
with type 2a and 2b isolates compatible with classification within the same type, 
but in another subtype which is the proposed subtype 2c. 

Isolates BE93 and BE94 did not show any reactivity with the subtyping 
probe 14. After sequencing the 5'untranslated region and the NS5 region, it was 
concluded that these isolates belonged to the 3a subtype. Therefore, a probe 
containing a C and A degeneration at position -118 like probe 35, should allow 
better genotyping of subtype 3 a. 
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Isolates GB48, GB116, GB358, and GB569 showed positive 
hybridization signals on probe 17 and 19 in LiPA, indicating similarity to the 
previously reported type 4 isolates, but isolates GB549 and GB809 only reacted 
with the universal probes. The sequences of parts of the 5 'untranslated region and 
NS5 were obtained. From Figure 5 and 6 and Table 6, it can be concluded that the 
isolates represented by GB358 belong to the same subtype of type 4, which is the 
proposed subtype 4a. However, both GB549 and GB809 show lower homologies 
to the subtype 4a, 4b and 4d isolates, and also to each other, but GB809 seems to 
belong to the same subtype as Z4. These homologies are compatible with 
classification into the same type 4, but into a different subtype of type 4: subtype 
4e for GB549 and subtype 4c for GB809 are proposed Sequences obtained from 
isolates GB116, GB358, and GB569 all showed the motifs 
AATCGCCGGGATGACC, detectable with probe 38 and 
TTTCCGGGCATTGAGC, detectable with probe 19. Thus, probes 38 and 19 are 
useful for detection and classification of subtype 4a. Probe 38 is specific for 
subtype 4a, 4b, 4d, 4f and 3b, while probe 19 recognizes subtypes 3 b, 4a and 4d, 
but also hybridizes to the new types 3 c and 4f Interestingly, the new subtype 3 b 
sequence HCV-TR should cross-react with these probes. However, 3b can still be 
classified as type 3 because of the reactivity with the type 3-specific probe 21. 

GB549 also shows characteristic motifs. Motif 
AATCGCCGGGACGACC can be detected by probe 40 and the sequence 
AATGCCCGGCAATTTG is detectable with probe 41. Thus, probes 40 and 41 
are useful for subtyping of subtype 4b. 

Reactivities identical to GB809 were obtained on LiPA with two samples 
obtained from Cameroun: CAM600 and CAM736. After sequencing the NS5 
region, it could be concluded that these samples belong to the same subtype as 
GB809, and after sequencing the 5'untranslated region, two identical motifs were 
again detected as those already present in GB809. Thus, it appears that the motif 
AATCGCCGAGATGACC, detectable with probe 42, and 
AATGCTCGGAAATTTG, detectable with probe 43, are characteristic for 
subtype 4c, and that probes 42 and 43 are useful for detection and classification of 
subtype 4c. However Z4, which shows homology in the El region compatible with 
classification into the same subtype 4c, shows 5'UR sequences which are again 
unique and may be detected by probes 7, 50 and 53. 

New sequences were detected in the 5 'untranslated region of isolates 
GB487, GB724 and BE97. A new subtype classification, not based on sequence 
information of the coding region, is tentatively proposed for these isolates. All 
three isolates show the sequence TGCCTGGAAATTTGGG, detectable with 
probe 50. GB487 shows the unique sequence AATCGCCAGGATGACC, 
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detectable with probe 49, and is tentatively classified as subtype 4h. GB724 and 
BE97 both contain the sequence GGAATCGCCAGGACGA, detectable with 
probe 53, and are tentatively classified as subtype 4g, 

Type 4 isolates usually show a T at position -238 and a A at -235. 
Therefore, probes 37, 38, and 51 should enable better genotyping of type 4. 

In another example, BE95, which only hybridized to probes 7 and 17 in 
the LiPA shows low homologies in the coding region of about 68% with all other 
isolates, except BE96 which shows an homology to BE95 compatible with 
classification into the same subtype, which is the proposed subtype 5a. BE95, 
BE96, and SAl all show the same motifs GAGTGTCGAACAGCCT, detected 
with probe 44; AATTGCCGGGAYGACC, detectable with probes 45 and 47; and 
TCTCCGGGCATTGAGC, detectable with probe 46. Thus, probes 44, 45, 46 and 
47 are useful for genotyping of type 5a. 

Sequences have been published by Bukh et al. (1992), which contain a 
unique CA insertion between positions -144 and -145. These isolates are 
tentatively classified as type 6 and can be detected by means of probe 48. 

A new type of hepatitis C virus was discovered in isolate BE98, which 
only reacted with probe 19 on LiPA. The sequence of the 5 'untranslated region 
contains the new motif GAATCGCCGGGTTGAC that can be detected by means 
of probe 54, Sequencing of the core region revealed sequences showing about 
equally distant homologies to genotypes 3a and 3b, and a new type 3c is proposed 
for this prototype sequence. 

Isolate GB438 contains sequence motifs which are typical for subtype 4a, 
detectable with probes 38 and 19, but still shows a different sequence in the El 
region, representing a new subtype within type 4, which was designated subtype 
4f. Discrimination from subtype 4a may be performed by means of probes with 
SEQIDN0 51 and 7. 

Probes 29, derived from the sequence of BE90, and probes 51 and 52, 
derived from the sequence of GB724, may be useful to improve genotyping of 
certain HCV types or subtypes. 

Example 9: calculation of nucleotide distances 

Phylogenetic analysis. 

Previously published sequences were taken from the EMBL database, release 35. 
Other sequences were analysed by the inventors and have been deposited in the 
DDBJ database. Sequences were presented in a sequential format to the Phylogeny 
Inference Package Version 3.5c (Felsenstein, March 1993). Only sequences with 
identical lengths were included in the similarity calculations. The programs 
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employed were DNADIST, PROTDIST, DNAPARS, PROTPARS, NEIGHBOR, 
SEQBOOT, CONSENSE and DRAWTREE. DNA maximum likelihood distance 
matrices were produced by DNADIST using the Kimura 2-parameter setting, A 
bootstrapping analysis was run using SEQBOOT, with 2000 repetitions. All these 
matrices were further analyzed in NEIGHBOR, using the Neighbor- Joining 
settings and in CONSENSE to calculate the consensus tree. The SEQBOOT 
dataset was also analyzed in the DNAPARS program on 1130 repetitions. 
Deduced protein sequences were analyzed in PROTDIST followed by 
NEIGHBOR. Finally, the program DRAWTREE was used to create a graphic 
output of the phylogenetic tree. All analyses were done on a SUN SPARC IPX 
computer station. 

The NS5 region. 

By using the primer set described by Enomoto et al. (1990), we amplified, cloned 
and sequenced 340 bp long NS5b PCR fragments from 13 different isolates. The 
nucleotide sequences were used to create a phylogenetic tree using the DNADIST 
program of the PHYLIP 3.5c package (Felsentstein, 1993). A diversity of 6 major 
groups or 'types' is evident for this unrooted tree. Each group could be further 
subdivided into two ore more subgroups or 'subtypes'. The following clusters 
(groups consisting of closely related isolates) were created : la, lb, 2b, 3a, 3b, 4a 
and 5a. This clustering appeared in 1 00% of the bootstrap resampled data sets 
using the programs SEQBOOT/DNADIST/NEIGHBOR/CONSENSE on 2000 
repetitions. The bootstrapped DNAPARS analysis yielded a similar clustering. 
From the DNADIST matrix, the molecular evolution distances between isolates, 
subtypes and types could be calculated. Only the above indicated separated 
clusters were included in these calculations. Between isolates in one subtype, this 
distance ranged from 0.0148 to 0.1064 (mean 0.0623; SD 0.0181). The distance 
between subtypes ranged from 0.1654 to 0.2675 (mean 0.2312; SD 0.0182) and 
that between types from 0.3581 to 0.6549 (mean 0.4942; SD0.0485). However, 
some exceptional cases appeared. 

The distance between HC-J6 and isolate BE92 was 0.1539, a low value for 
distances between distances, but far above any value obtained between isolates 
belonging to the same subtype. NS5 nucleotide sequence homology between HC- 
J6 and BE92 was 86.2%. The bootstrapped DNA datasets clustered both 
sequences in 98.8%) of the cases, which is an argumentation for a subtype 2a 
classification, but the molecular evolutionary distance and the sequence of the 5' 
UR of BE92 allowed us to tentatively classify this isolate as subtype 2c. 
GB809 could be positioned at a mean distance of 0.1509 (min./max. = 
0.1384/0.1597) from the type 4a cluster. A maximum homology of 87.4%) exists 
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between GB809 and GB358 (type 4a). However, these data, together with the 
observed variations in the 5'UR allowed us to create a new type 4 subtype, 4c- 
GB549 represents the new subtype 4e and has distances of 0.2426 from cluster 4a; 
0.2403 from subtype 4c and 0.1738 from GB438 at the nucleotide level. Isolate 
GB438 possibly represents another new type 4 subtype, tentatively designated 4f. 
The core/El region. 

Calculation of the phylogenetic tree for the core/El region between nucleotides 
378 and 957 using the DNASDIST program resulted in the recognition of six 
major branches, representing the 6 different genotypes. The following clusters 
could be delineated with a 100% certainty from the bootstrap resampling analysis 
on 2000 repetitions: subtype la, lb, 2a, 2b, 3a and 4a. The clustering is irrelevant 
for subtype 4c, 4e, 4f and 5a because they are only represented by one isolate. 
Based on the DNADIST matrix, the molecular evolutionary distance for isolates 
belonging to the same subtype ranged from 0.0402 to 0.111 (mean 0.0772, SD 
0.0197), between subtypes from 0.1864 to 0.3535 (mean 0.2833, SD 0.0350) and 
between types from 0.3824 to 0.6230 (mean 0.4894, SD 0.0554). 
The distances from the DNADIST matrix provided further evidence for the 
existence of at least 4 different subtypes in type 4. Type 4a has a mean mutual 
distance of 0,0083; while the mean with type 4c, 4e and 4f was 0.2602. Subtype 4c 
and 4f were separated from 4e by respectively, 0.2047 and 0.1864, while the 
distance between 4c and 4f was 0.2316. 
The NS3/4 region. 

DNA sequences containing the previously described type 3 a epitope region 
(Stuyver et ah, 1993a) and other sequences of the EMBL databank were used to 
calculate the nucleotide distances using DNADIST and NEIGHBOR. From the 
DNADIST matrix, the molecular evolutionary distances between isolates ranged 
from 0.0407 to 0.1181 (mean 0.0855, SD 0.0190), between subtypes from 0.2281 
to 0.2603 (mean 0.2416, SD 0.0098) and between types from 0.4052 to 0.6247 
(mean 0.4889, SD 0.0531). 

Example 10 

As described in the introduction to the examples and in previous examples, 
variable regions in the 5'UR are expected to contain genotype-specific sequences 
also in newly discovered genotypes, as examplified in example 8, and 
consequently, such new genotype-specific motifs should again be detectable by 
means of the genotype-specific probes as described in example 8. Therefore, 
probes 32, and as described in example 8, probes 31, 33, 34, 37, 38, 44, 45, and 46 
were synthesized and applied to nitrocellulose membranes and line probe assays 
with biotin-labelled PGR fragments was performed as described in example 3 and 
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4, except for the labelling of the PGR product with biotin which was not 
incorporated from bio-1 1-dUTP, but from of S'-biotinylated primers with SEQ ID 
NO 3 and 4 or 5*-biotinylated primers with SEQ ID NO 1 and 2, during the 
synthesis of the PGR fragment. 

Figure 6 shows the type-specific hybridization of HGV type 2, but not type 1, 5'UR 
fragments to the probe with SEQ ID NO 32. Both subtype 2a and 2b isolates 
hybridized specifically to probe 32. 

In Figure 7, the probes with SEQ ID NO 33 and 34 could be shown to hybridize 
specifically to the genotype 2c PGR product derived from serum BE92, while 
genotype 2a and 2b sera did not react to these probes although a specific 
hybridization with the respective 2a and 2b genotype-specific probes could be 
observed. It is to be understood that the new genotype 2c may differ from other 
genotype 2c subtypes discovered recentiy, and therefore, the alternative names may 
be proposed for denomination of this subtype. 

Figure 8 shows line probe assays performed with type 4 sera to show specific 
hybridization of the most common type 4 sera to probes 37 and 38. 
Figure 9 depicts specific hybridization of type 5a sera to probes 44, 45, 46, and 
probe 7, while reactivity of type 4 sera is usually confined to probe 3 1 and absent 
on probes 44 to 46, Therefore, the promiscuity of the probe with SEQ ID NO 18 for 
both type 4 and type 5 isolates, can now conveniently be overcome by employing, 
in addition to the probe with SEQ ID NO 19, probes with SEQ ID NO 37, 38, 44, 
45, 46, 7, 30, and 3 1 for discrimination of genotypes 4 and 5. 

Example n 

It may be preferable to use other hybridization conditions (temperature, buffers) as 
those outiined in examples 3 and 4. Therefore, probes with SEQ ID NO 93, 94, 95, 
and 96 were applied to nitrocellulose membranes. Figure 10 shows line probe 
assays with the type 4a serum GBl 16 and the type 5a serum BE95, as described in 
example 4, exept for the following: After denaturation of the PGR fragment in 
NaOH/SDS, LO ml hybridization solution (prewarmed at 50°G) consisting of 3x 
SSC (Maniatis et al., (1982) and 1% sodium dodecyl sulphate (SDS), was added to 
the denatured PGR product and hybridization was performed in a shaking water 
bath at 50°G for 2 hours. The strips were washed with the same hybridization 
solution at SO^'C for 30 min, after which the strips were washed and color 
development was performed as described in example 4. In Figure 10, a clear type- 
specific reaction can be observed. Therefore, type-specific hybridization has been 
obtained in other hybridization conditions as described in examples 4 and 10 after 
having introduced minor modifications to the hybridization probes. For example, 
the position of the probes can be changed in order to achieve more specific 
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hybridization in a certain hybridization condition, for example, the probe can be 
positioned in such a way that the type-specific nucleotides are located in the 
middle part of the probe. For certain probes to retain specificity in other 
hybridization conditions, it may also be preferable to elongate or shorten the 
contiguous HCV sequence and/or to reverse the sense of the probes to allow 
genotype-specific hybridization at a certain preferred temperature or salt 
concentration. However, in some cases, it may be preferable to include inosines or 
mismatching nucleotides to allow genotype-specific hybridization at a certain 
preferred temperature or salt concentration. For example, the probe with SEQ ID 
NO 37, which was able to discriminate between type 4 and 5 isolates in 
tertramethylammoniumchloride buffer as described in example 10, was now 
changed into probe with SEQ ID NO 93 (5'-GAGTGTTGTACAGCCTCC-3') by 
elongation of the contiguous HCV sequence at the 3' end with 2 nucleotides, and 
probe 93 showed a specific reactivity in SSC/SDS hybridization buffer (Figure 
10). The probe with SEQ ID NO 44, which was able to discriminate between type 
4 and 5 isolates in tertramethylammoniumchloride buffer as described in example 
10, was now changed into probe with SEQ ID NO 96 (5'- 
GAGTGTCGAACAGCCTC-3') by elongation of the contiguous HCV sequence at 
the 3* end with 1 nucleotide, and probe 96 showed a specific reactivity in 
SSC/SDS hybridization buffer (Figure 10). The antisense probe with SEQ ID NO 
46, which targets positions -132 to -117 was able to discriminate between type 4 
and 5 isolates in tertramethylammoniumchloride buffer as described in example 
10, was now changed into probe with SEQ IS NO 95 (5*- 
TGCCCGGAGATTTGGG-3*), a sense probe which targets positions -126 to -1 II, 
and probe 95 showed a specific reactivity in SSC/SDS hybridization buffer (Figure 
10). 

This example illustrates the numerous possibilities of developing probes to those 
skilled in the art for targetting the genotype-specific mutations as presented in 
Figure 4, or for targetting the genotype-specific mutations that are present in other 
new or still to be discovered genotypes. 
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Table 3: Final results of HCV LlPA typing and HCV antibody assays 



Isolate 


NS4 


NS5 


Core 


LIA 


EIA 


type 


BRl 


3 


3 


3 


1 




la 


BR2 


3 


1 


2 


1 




la 


BR3 


0 


0 


0 


0 




la 


BR4 


1 


0 


0 


9 




la 


BR5 


3 


3 


2 


1 




la 


BR6 


3 


3 


2 


1 




la 


BR7 


9 


0 


1 


1 




la 


BR8 
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9 


I 


9 




la 


BR9 
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0 
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lb 


BRIO 
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9 




lb 
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1 


1 
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0 


0 


9 




lb 
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1 


0 


0 


9 




lb 
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3 


1 




2a 
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2 


1 




3b 
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1 


9 




3a 
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1 




3a 
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Table 3: continuation 



-55- 



o 
43 



in 

B 
O 

u 

CD 
U 

c 

0) 
0) 



o 



o 



i 
7: 











ON 






ON 








a 






,2 


ON 




















.1 






W 

<U 






0 




ese 


3 


0 





CO 2h 



O 
O 
U 
< 



H < 

< o 

O CD 



< 
< 

< a 

O O 



CJ < 

a o 

CJ o 

S p 



0^ 



00 ON ^ VO 
ON ol 0^ On 
M w-, w 

Oh PU Oh 

u u o o 
K K DC K 



-a -a -a *a 

D 1:3 P P 



T-H tN CO 



o o 

On On 
On On 



o o 



O o CJ 

< P P 

o 5 h 
o o y 

a o 9 
<: ;^ 

o o CD 
h o CJ 

b H CD 

< u u 
^ H u 



o 



-'i- 00 
(N m 



(X, fLi Oh 

Pu IX! JX! 



ON On ON ON ON 

r-H ON On On »— t * on On 

ON»~'CSC^'^'^ONOx'— 
ON *ONON „ ,OnON 
^^^^^^"^"^^^ 



Oo<upp^^QO 
c« a CJ a 

^ g cd cd 

zodudd2:2:oo 



p rtt 




4- + -f 



vooor-v^NOoot^trjvo 



r-'O^or^^oovDr^r^ON 

a.a40-,aiCuoHpu,(xoHa; 
oooooooouo 



00 ^ 2 S =: ^ ?5 ?3 



.2 .S .2 §^ 85 

C a C5 

> > > ' ' 

.s -S .a 

C S G ^ ^ 

« <L> 4) C C 

§ g g jS 

(X O- D- CJ CJ 



CD H 

O b 

H CD 



y ^ CJ 

a a a 
< u u 

CJ CJ 
U CJ 

< < 

< < 

O CJ 

rH CD a 

O CD 

a CJ CJ 

CJ H H 



4- + 4- 



O CO vo vo 
-H o Tj- 



CO On ^ 
oi o< »n 



O ON 



CU O-i Oh Oh CU 

u o o o o 
K PC PC PC PC 



m m on CO cn 



CO TT — ' *o 

^ CM ^ — ^ 



o o 



03 ^5 .2 



Cu O- 




4. 4- 



O 



\n \o 
^ ^ 



Om O^ Oh 

PC I-C PC 



o o 

ON 0\ 
ON ON 



o o 

CO en 



Tt 



u < 
CD CD 
CJ H 

u a 

CJ CD 

0 a 

O CJ 

o < 
< 
o a 
o c:) 

CD CJ 

o o 



+ + 



10 v-i 

^ On 



<N CO 

wo 



PC 



> > 

•a *a 

D P3 



o , 

CN CN 




-56- 



Table 5 



lype 


Sipniif^nce ?!p to 3' 


SEQ ED NO 


1 


AATTGCCAGGACGACC 


5 


176 


TCTCCAGGCATTGAGC 


6 


1/6 


A A nmrrir'p A nn A vn A rr' 


28 

^ (J 


la/2 


CCL-CvjUAAvjAL^ 1 \J\^ i A 




lb 


(jrC i C AU 1 uUU i vjUAvjr A 


9Q 


lb/3c/5 


CCvjCUALrAL. Y I AUri^ 




2/6 


CGTACAGCCTCCAGGC 


32 


2a 


CCGGGAAGACTGGGTC 


22 


2a 


ACCCACTCTATGCCCG 


24 


2b 


ACCCACTCTATGTCCG 


25 


2c 


GGACCCAGTCTTCCTG 


33 


2 c 


TGCCTGGTCATTTGGG 


34 


3a' 


CCGCAAGATCACTAGC 


36 


3a 


TKTCTGGGTATTGAGC 


35 


3c 


GAATCGCCGCjG 1 1 GAi^ 




4/5 


AATYGCCGGGATGACC 


17 


4a/4b/4c/4d/4g/4h 


GAGTGTTGTACAGCCT 


37 


4e 


GAGTGTTGTGCAGCCT 


39 


3b/4a/4b/4d/4f 


AATCGCCGGGATGACC 




3b/4a/4d/3c/4f 


TTTCCGGGCATTGAGC 


19 


4e 


AATCGCCGGGACGACC 


40 


4e 


AATGCCCGGCAATTTG 


41 


4c* 


AATCGCCGAGATGACC 


42 


4c 


AATGCTCGGAAATl'lG 


43 


4h 


AATCGCCAGGATGACC 


49 


4ii/4g 


TGCCTGGAAATTTGGG 


50 


4g 


GGAATCGCCAGGACGA 


53 


4f/4e 


AGTYCACCGGAATCGC 


52 


4f/4g/6/2a/2b/2c 


GAGTGTCGTACAGCCT 


51 


5a/5 


GAGTGTCGAACAGCCT 


44 


5a* 


AATTGCCGGGATGACC 


45 


5a/5 


AATTGCCGGGACGACC 


47 


5a* 


TCTCCGGGCATTGAGC 


46 


6a/6 


GGGTCCTTTCCATTGG 


48 
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